Draft version April 5, 2013 

Preprint typeset using L^T^X style cmulatcapj v. 5/2/11 



CLASH: COMPLETE LENSING ANALYSIS OF THE LARGEST COSMIC LENS MACSJ0717.5- 

SURROUNDING STRUCTURES 



3745 AND 



o 

U 

Oh 
< 

o 
u 

Oh 

o 

in 



Elinor Medezinski 1 , Keiichi Umetsu 2 , Mario Nonino 3 , Julian Merten 4 , Adi Zitrin 5 , Tom Broadhurst 6,7 ' 2 , Megan 
Donahue 8 , Jack Sayers 9 , Jean-Claude Waizmann 10 , Anton Koekemoer 11 , Dan Coe 11 , Alberto Molino 12 , Peter 
Melchior 13 , Tony Mroczkowski 4,9 ' 14 , Nicole Czakon 9 , Marc Postman 11 , Massimo Meneghetti 10 , Doron Lemze 1 , 

Holland Ford 1 , Claudio Grillo 15 , Daniel Kelson 16 , Larry Bradley 11 , John Moustakas 17 , Matthias 

Bartelmann 5 , Narciso Benitez 12 , Andrea Biviano 3 , Rychard Bouwens 18 , Sunil Golwala 9 , Genevieve Graves 19 , 

Leopoldo Infante 20 , Yolanda Jimenez-Teja 6 , Stephanie Jouvel 21 , Ofer Lahav 22 , Leonidas Moustakas 4 , Sara 

Ogaz 11 , Piero Rosati 23 , Stella Seitz 24 , and Wei Zheng 1 

Draft version April 5, 2013 

ABSTRACT 

The galaxy cluster MACS J0717. 5+3745 [z = 0.55) is the largest known cosmic lens, with complex 
internal structures seen in deep X-ray, Sunyaev-Zel'dovich effect and dynamical observations. We 
perform a combined weak and strong lensing analysis with wide-field BVR c i' z' Subaru/Suprime-Cam 
observations and 16-band Hubble Space Telescope observations taken as part of the Cluster Lensing 
And Supernova survey with Hubble (CLASH). We find consistent weak distortion and magnification 
measurements of background galaxies, and combine these signals to construct an optimally estimated 
radial mass profile of the cluster and its surrounding large-scale structure out to 5Mpc//i. We find 
consistency between strong-lensing and weak-lensing in the region where these independent data 
overlap, < 500 kpc//i. The two-dimensional weak-lensing map reveals a clear filamentary structure 
traced by distinct mass halos. We model the lensing shear field with 9 halos, including the main cluster, 
corresponding to mass peaks detected above 2.5<7 K . The total mass of the cluster as determined by the 
different methods is M v - lv ss (2.8 ± 0.4) x 10 15 Mq. Although this is the most massive cluster known 
at z > 0.5, in terms of extreme value statistics we conclude that the mass of MACS J0717. 5+3745 by 
itself is not in serious tension with ACDM, representing only a ~ la departure above the maximum 
simulated halo mass at this redshift. 
Subject headings: cosmology: observations - - dark matter - - galaxies: clusters: individual 

(MACS J0717. 5+3745) -- gravitational lensing: weak -- gravitational lensing: 
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1. INTRODUCTION 

In hierarchical structure formation theories massive 
clusters are formed relatively recently and are still grow- 
ing through the accretion of substructure. Accretion 
is predicted to occur preferentially along filaments with 
clusters at the nodes of intersection (Bond et al. 1996), 
a pattern which is now clearly visible in densely sam- 
pled large redshift surveys (Colless et al. 2001; Huchra 
et al. 2005), all-sky optical surveys such as the Sloan 
Digital Sky Survey (SDSS; Tegmark et al. 2004) and the 
Baryon Oscillation Spectroscopic Survey (BOSS; White 
et al. 2011) and large lensing shear surveys (e.g., Massey 
et al. 2007a; Van Waerbeke et al. 2013). Increasing num- 
bers of clusters caught in the act of merging are found 
amongst the most luminous X-ray sources (Ebcling et al. 
2007) and strongest Sunyaev-Zel'dovich effect (SZE) sig- 
nal (Planck Collaboration et al. 2011; Marriage et al. 
2011; Vandcrlinde et al. 2010). Selection effects are now 
understood to strongly favor the detection of gas com- 
pressed or shocked during cluster collision, as illustrated 
by hydrodynamical simulations (Ricker & Sarazin 2001; 
Burns et al. 2008; Molnar et al. 2012). 

Large simulations of the growth of structure in the 
context of the standard cold dark matter (ACDM) (Ko- 
matsu et al. 2009) cosmological model have generated 
increasingly accurate predictions for the evolution of the 
cluster mass function, extending to a limiting halo mass 
of approximately 2 x 1O 15 M (Neto et al. 2007; Duffy 
et al. 2008; Zhao et al. 2009; Bhattacharya et al. 2011). 
The number density of very massive clusters is predicted 
to change relatively rapidly at low redshift, z < 1.0, 
where the evolution is principally sensitive to the cos- 
mological matter density, Q m . To second order one 
may hope to examine the constancy with redshift ex- 
pected for the "dark-energy" density (Allen et al. 2004; 
Mantz et al. 2008, 2010b; Schmidt & Allen 2007) and 
test for self-consistency of General Relativity (Rapetti 
et al. 2010). Presently, however, there are only indi- 
rect determinations of the masses of statistical samples 
of clusters, selected by X-ray means (Ebeling et al. 2000, 
2001; Vikhlinin et al. 2009a), and numbering only less 
than ~ 300 clusters, with masses mostly derived from 
uncertain X-ray scaling relations (Mantz et al. 2010a; 
Vikhlinin et al. 2006). Although these studies have not 
yet challenged the standard model (Mantz et al. 2010b; 
Vikhlinin et al. 2009b), much larger lensing-based sur- 
veys of clusters will eventually provide much more detail, 
in particular the wide area Subaru/Hyper Suprime-Cam 
survey (Takada 2010), and later Big-BOSS (Schlegel 
et al. 2009), LSST (Ivezic et al. 2008), Euclid (Laureijs 
et al. 2011) and WFIRST (Green et al. 2012). 

Despite the lack of lensing-based cluster mass func- 
tions, we may progress by exploring the most extreme 
clusters (Owcrs et al. 2011; Waizmann et al. 2012b; 
Colombi et al. 2011), in particular the most distant 
(Hoyle et al. 2011), because of the exponential sensitiv- 
ity of the cluster mass function to the growth of struc- 
ture (Bahcall et al. 1995; Chongchitnan & Silk 2012). 
Anomalously large masses have been claimed for clusters 
at z < 1.5 (Rosati et al. 2009; Jee et al. 2009; Santos 
et al. 2011; Foley et al. 2011; Tozzi et al. 2013). At 
lower redshifts, accurate full strong+weak lensing total 
masses were measured for the massive clusters such as 



Abell370 (z = 0.375) and RX J1347.5 (z = 0.45), of or- 
der of w 2- 3 x 10 15 M Q (Broadhurst et al. 2008; Umetsu 
et al. 2011b). These clusters were selected from all-sky 
X-ray surveys (Ebeling et al. 2000, 2001, 2010), so that 
although the masses are larger and their redshifts are 
lower, the degree of tension with ACDM is not extreme 
(Waizmann et al. 2012b). At the present time, no indi- 
vidual cluster has been uncovered that strains the credi- 
bility of the standard ACDM model. 

Mapping the mass distribution in clusters has provided 
insight into the physics that govern dark matter (DM), 
the main mass component in the Universe whose nature 
is largely unknown. The iconic 'Bullet' cluster (Marke- 
vitch et al. 2002; Clowe et al. 2004) is an example of 
a post-merger cluster collision phase, where the diffuse 
gas component has been separated by ram-pressure from 
the DM and galaxies (Springel & Farrar 2007; Mastropi- 
etro & Burkcrt 2008), whereas lensing shows that the 
DM and the galaxies are still spatially coincident (Clowe 
et al. 2006). This system serves to show that DM is effec- 
tively collisionless in nature. Subsequently, several new 
examples of bullet-like clusters have since been discov- 
ered with large-scale supersonic shock fronts (Markevitch 
et al. 2005; Menanteau et al. 2012; Russell et al. 2010; 
Korngut et al. 2011; Macario et al. 2011; Owers et al. 
2011; Mcrten et al. 2011; van Weeren et al. 2012), which 
are hard to reconcile with the expected pairwise velocity 
distribution of colliding galaxy clusters, where relative 
impact velocities in excess of 2000 km/s are unlikely in 
the context of ACDM (Lee & Komatsu 2010; Thompson 
& Nagamine 2012). 

To shed new light on these mysteries of DM and test 
structure formation models with unprecedented preci- 
sion, the Cluster Lensing And Supernova survey with 
Hubble (CLASH, Postman et al. 2012) 26 , a 524-orbit 
Hubble Space Telescope (HST) multi-cycle treasury pro- 
gram, has been in progress to couple the lensing power of 
25 massive clusters (M vir = 5 — 30 x 1O 14 M , z mod = 0.4) 
with HST in 16 passbands with full UV/optical/IR cov- 
erage, complemented by Subaru wide-field imaging ca- 
pabilities (e.g., Umetsu et al. 201 la, b). Importantly, 20 
CLASH clusters were X-ray selected to be relatively re- 
laxed, in order to examine the concentration-mass rela- 
tion for a sample with no strong selection bias toward 
high concentration clusters. A further sample of 5 clus- 
ters were selected by their high lensing magnification 
properties, with the goal of detecting and studying high- 
redshift background galaxies magnified by the cluster po- 
tential. 

MACSJ0717.5+3745 (hereafter, MACSJ0717; z = 
0.5458) is one of the CLASH high-magnification clus- 
ters, and was originally detected by its X-ray emission, 
as part of the MAssive Cluster Survey (MACS, Ebcling 
et al. 2001) and independently in the radio (Edge et al. 
2003). The cluster has the highest X-ray temperature 
in the MACS sample (k B T x = 11.6 keV Ebeling et al. 
2007, also see Table 1), and has since been revealed as 
one of the most dynamically disturbed clusters known. 
In the core of this cluster a complex four-component 
merging activity has been inferred from optical, X-ray 
and internal dynamics (Ma et al. 2008, 2009). Low- 
frequency Radio observation reveal very complex diffuse 

26 http://www.stsci.edu/~postman/CLASH 
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radio emission (Edge et al. 2003; van Wceren et al. 2009; 
Bonafede et al. 2009) , indicative of a radio relic or halo, 
and thought to be a signature of major mergers and com- 
mon to many of the distant MACS clusters where large- 
scale supersonic shocks are found (Bonafede et al. 2012). 
High speed gas motion within MACSJ0717 is also ten- 
tatively inferred from dynamical data in a large spec- 
troscopic study, where structural components were de- 
fined on the basis of X-ray emission peaks (Ma et al. 
2009). This high relative velocity has been confirmed in 
the multi-frequency SZE measurements of Mroczkowski 
et al. (2012), which deviate from a thermal SZE spectrum 
in a way that is consistent with kinetic SZE emission. 

A strong-lensing (SL) analysis for this cluster has un- 
covered a complex elongated tangential critical curve en- 
compassing the central substructures (Zitrin et al. 2009a; 
Limousin et al. 2012). In terms of the critical area, 
this cluster has the largest strong lensing area, with 
an equivalent Einstein radius of, 8e = 55" ± 3" (at 
z s = 2.963), and an extremely large mass inside this 
region, M 2D (< 9 B ) ~ 7 X 1O 14 M (Zitrin et al. 2009a) 
was deduced. It has been argued that the size of the Ein- 
stein radius here may be inconsistent with ACDM(Zitrin 
et al. 2009a; Meneghetti et al. 2011). However, Waiz- 
mann et al. (2012c) showed that it can be explained as 
an extreme case in the context of the lens orientation 
or other effects. On the larger scale, MACSJ0717 was 
shown to be part of a filamentary structure from galaxy 
distributions (Ebeling et al. 2004), and from weak- lensing 
(WL) (Jauzac et al. 2012), possibly spanning 4Mpc in 
length. 

In this paper, we aim to quantify the complex mass 
properties of MACSJ0717 and its surrounding large-scale 
structure (LSS) by employing a comprehensive weak and 
strong lensing analysis based on deep, wide-field Subaru 
BVR c i'z' imaging, combined with our recent CLASH 
HST imaging. We use our methods to derive a robust to- 
tal mass estimate, and calculate meaningful constraints 
on the existence of such rare high-mass peaks in a ACDM 
cosmology. The paper is organized as follows. In Sec- 
tion 2, we describe the observational dataset, its reduc- 
tion and WL shape measurements. In Section 3 we de- 
scribe the selection of cluster and background galaxies 
for WL analysis. In Section 4 we present the WL anal- 
ysis using Subaru observation. In Section 5, we present 
improved SL analysis using our new CLASH HST ob- 
servations and in Section 6 we present a complementary 
WL analysis using the HST observations. In Section 7 we 
derive cluster mass profiles from lensing, combining SL 
with WL shear and magnification measurements, and in 
Section 8 we present the cluster mass distribution span- 
ning both large scales and zooming in on the core, and 
constrain individual mass peaks using a multi-halo mod- 
eling approach. In section 9, we discuss our lensing mass 
properties, compare with complementary X-ray and SZE 
measurements, and contrast the cluster total mass we 
derive with predictions from ACDM cosmology using ex- 
treme value statistics. Finally, a summary of our work is 
given in Section 10. 

Throughout this paper, we use the AB magnitude sys- 
tem, and adopt a concordance ACDM cosmology with 
ti m = 0.3, tt A = 0.7, and H = 100/ikms" 1 Mpc" 1 
with h = 0.7. In this cosmology, 1' corresponds to 



268kpc/i~ 1 = 383 kpc at the cluster redshift, z = 
0.5458. All quoted errors are 68.3% confidence limits 
(CL) unless otherwise stated. The center is taken as the 
mean location of red-sequence selected cluster members, 
R.A.=07:17:32.63, Dcc.=+37:44:59.7 (J2000.0). 

2. SUBARU+CFHT OBSERVATIONS 

In this section we present the data reduction and anal- 
ysis of MACSJ0717 based on deep Subaru+CFHT multi- 
color images (Section 2.1). We briefly describe our WL 
shape measurement procedure in Section 2.2. 

2.1. Data Reduction and Photometry 

We analyze deep BVR c i'z' images of MACSJ0717 ob- 
served with the wide-field camera Suprime-Cam (34' x 
27'; Miyazaki et al. 2002) at the prime focus of the 8.3-m 
Subaru telescope. We observed the cluster on the night 
of 2010-03-17, in B,Rc, z' , to augment shallower obser- 
vations that existed in the Subaru archive, SMOKA 27 . 
Some of the archival data for this cluster was taken as 
part of the 'Weighing the Giants" program (von der Lin- 
den et al. 2012). The seeing FWHMs in the co-added mo- 
saic images are 0.95" in B (3.84 ks), 0.69" in V (2.16ks), 
0.79" in R c (2.22 ks), 0.96" in i' (0.45 ks), and 0.85" in z' 
(5.87 ks) with 0.20" pixel -1 , covering a field of approxi- 
mately 36' x 34'. The limiting magnitudes are obtained 
as B = 26.5, V = 26.5, R c = 26.2, I c = 26.0, and 
z' = 25.0 mag for a 3cr limiting detection within a 2" 
diameter aperture. 

To improve the accuracy of our photometric red- 
shifts, we also include UV observations from the 
Megaprime/MegaCam in the it* -band and Near-IR ob- 
servations from the WIRCAM in the J, Z4T s -bands on the 
CFHT telescope, available from the CFHT archive 28 . 
Although shallower, and in the case of NIR data the cov- 
erage is only of the inner 15' x 15', these extra bands add 
important information to help better constrain the SED 
of galaxies with degenerate fits. 

The observation details of MACSJ0717 are listed in 
Table 2. Figure 1 shows a u*BVR c i'z' composite color 
image of the cluster central 28' x 28', produced auto- 
matically using the publicly available Trilogy software 
(Coe et al. 2012). 29 . We overlay it with the DM map 
determined from WL (white contours, see Section 8) and 
the smoothed X-ray luminosity map (red contours, see 
Section 9.2). 

Our reduction pipeline derives from Nonino et al. 
(2009) and SDFRED (Ouchi et al. 2004; Yagi et al. 2002) 
and has been optimized for accurate photometry and WL 
shape measurements. Standard reduction steps include 
bias subtraction, flat-field correction (super-flat averaged 
from all exposures of the same night where objects have 
been masked) and point-spread function (PSF) matching 
between exposures in the same band (if PSF variation is 
evident) as we normally use multi-epoch images taken 
under different conditions. Masking of saturated star 
trails and other artifacts is then applied. 

To obtain an accurate astrometric solution for Subaru 
observations, we retrieved processed MegaCam r-band 

27 http://smoka.nao.ac.jp 

28 This research used the facilities of the Canadian Astronomy 
Data Centre operated by the National Research Council of Canada 
with the support of the Canadian Space Agency 

29 http://www.stsci.edu/~dcoe/trilogy/ 
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Figure 1. 28' X 28' CFHT u*+Subaru BVR c i'z' composite color image showing the galaxy cluster MACSJ0717 (z = 0.548). Overlaid 
are the surface mass density map reconstructed from our Subaru WL analysis (white contours) and the X-ray brightness map from 
XMM-Newton observations (red contours). North is up and east is to the left. 



(Filter Number: 9601) images from the CFHT archive 
and used it as a wide-field reference image. A source 
catalog was created from the co-added MegaCam r im- 
age, using the 2MASS catalog 30 as an external reference 
catalog. The extracted r catalog has been used as a ref- 

30 This publication makes use of data products from the Two 
Micron All Sky Survey, which is a joint project of the Univer- 
sity of Massachusetts and the Infrared Processing and Analysis 
Center/California Institute of Technology, funded by the National 
Aeronautics and Space Administration and the National Science 
Foundation 



erence for the SCAMP software (Bcrtin 2006) to derive 
an astrometric solution for the Suprime-Cam and other 
CFHT images. We do not use the CFHT r band image 
in our photometry as the band is overlapping the Subaru 
i?C band, but with much lower resolution and depth, and 
therefore does not carry added information. 

For an accurate measure of photometry, we first smear 
the single exposures of the same band to the worst seeing. 
This step is done by using SDFRED/psfmatch proce- 
dure, after suitable point-like sources have been previ- 
ously selected. The WL band is chosen to be the band 
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with a combination of best seeing and deepest observa- 
tion, i?c in this case (see table 2). 

Finally, the Swarp software (Bcrtin ct al. 2002) is uti- 
lized to stack the single exposures on a common WCS 
grid with pixel-scale of 0.2" using the accurate registra- 
tion that was achieved in the previous step. This as- 
sures minimal distortion of the final image. Note, for the 
WL band we stack separately data collected at different 
epochs and different camera rotation angles. 

The photometric zero-points for the co-added Suprimc- 
Cam images were derived from a suitable set of reference 
stars identified in common with the calibrated Mega- 
Cam data. These zero-points were refined in two inde- 
pendent ways: first, by comparing with the HST/ ACS 
magnitudes of cluster elliptical-type galaxies, translated 
to Subaru magnitudes using an elliptical SED (spectral 
energy distribution) template with the BPZ code; sub- 
sequently, by fitting SED templates with the BPZ code 
(Bayesian photometric redshift estimation, Bem'tez 2000; 
Benitez et al. 2004) to Subaru photometry of 12 galax- 
ies having measured spectroscopic redshifts from the lit- 
erature 31 (Limousin et al. 2012) and calculating model 
magnitudes. This leads to a final photometric accuracy 
of ~ 0.01 mag in all passbands (see also Section 3.3). 

The eight-band u* BV R c i' z' J K$ photometry catalog 
was then measured using SExtractor (Bcrtin & Arnouts 
1996) in dual-image mode on PSF-matched images cre- 
ated by ColorPro (Coe et al. 2006), where a combination 
of B + V + R + z' bands was used as a deep detection 
image (we exclude the %' band which is of lesser quality) . 
The stellar PSFs were measured from a combination of 
100 stars per band and modeled using IRAF routines. 

2.2. Subaru Shape Measurement 

For shape measurements, we use our well-tested 32 WL 
analysis pipeline based on the IMCAT package (Kaiser 
et al. 1995, KSB hereafter), incorporating modifications 
and improvements developed and outlined in Umetsu 
ct al. (2010). Our KSB+ implementation has been ap- 
plied extensively to Subaru cluster observations (e.g., 
Broadhurst ct al. 2005a, 2008; Umetsu ct al. 2007; 
Umetsu k Broadhurst 2008; Okabe & Umetsu 2008; 
Umetsu et al. 2009, 2010, 2011b,a; Medezinski et al. 2010, 
2011; Zitrin et al. 2011; Coe et al. 2012; Umetsu et al. 
2012; Zitrin et al. 2013). Full details of our CLASH WL 
analysis pipeline are presented in Umetsu et al. (2012). 

In this analysis we use the i?c _ band data taken in 2005 
and 2010, which have the best image quality in our data- 
sets, taken in fairly good seeing conditions. Two separate 
co-added i?c-band images are created, one from 2005 

31 This research has made use of the NASA/IPAC Extragalactic 
Database (NED) which is operated by the Jet Propulsion Labo- 
ratory, California Institute of Technology, under contract with the 
National Aeronautics and Space Administration. 

32 Our analysis pipeline has been tested using simulated Subaru 
Suprime-Cam images (see Section 3.2 of Oguri et al. 2012; Massey 
et al. 2007b). We can recover the WL signal with good precision, 
typically, \m\ < 5% of the multiplicative calibration bias (as de- 
fined by the STEP project: see Heymans et al. 2006; Massey et al. 
2007b), where the range of m- values represents a modest depen- 
dence of calibration accuracy on seeing conditions and PSF prop- 
erties, and c ~ 10 -3 of the additive residual shear offset, which is 
about one order of magnitude smaller than the typical distortion 
signal in cluster outskirts (\g\ ~ 10~ 2 ). This level of performance is 
comparable to other similarly well-tested methods (Heymans et al. 
2006; Massey ct al. 2007b). 



(with a total of 450 sec, observer Yasuda) and one from 
2010 (with a total of 2160 sec, observed by us on March 
2010). We do not smear the single exposures before 
stacking (as done for photometric measurements), so as 
not to degrade and destroy the WL information derived 
from the shapes of galaxies. A shape catalog is created 
for each epoch separately and the catalogs themselves 
are then combined by properly weighting and stacking 
the calibrated distortion measurements for galaxies in 
the overlapping region. The combination of both epochs 
increases the number of measured galaxy shapes and im- 
proves the statistical measurement, while not degrading 
the quality of the shape measurement due to different 
seeing and anisotropy at different observing epochs. 

3. SAMPLE SELECTION 

For an undiluted WL detection, we need to carefully 
select a pure sample of background galaxies. In order to 
further explore the distribution of the cluster galaxies, 
we also identify the cluster members population. We use 
the B, R c , z' Subaru imaging which spans the full optical 
wavelength range to perform color-color (CC) selection 
of cluster and background samples, as demonstrated by 
Medezinski et al. (2010) and detailed below. 

3.1. Cluster Sample Selection 

In Figure 2 we show the B — Rq vs Rq — z' distribution 
of all galaxies to our limiting magnitude (cyan) . To iden- 
tify our cluster-dominated area in this CC-space, we also 
plot up B — R c vs. Rq — z' only of galaxies with small 
projected distance R < 3' ( ,$ 1 Mpc at z/ = 0.546) from 
the cluster center. A region is then defined according to a 
characteristic overdensity in this space (shown as a solid 
green curve in Figure 2). Then, all galaxies within this 
distinctive region from the full CC diagram define the 
"green" sample (green points in Figure 2), comprising 
mostly the red-sequence of the cluster and a blue trail 
of later-type cluster members. We note that the small 
overdensity seen bluer (lower-left, B — RcRq — z' ~ 0.7) 
than our green sample does not lie at the same redshift 
of the cluster and is not a bluer population that is part of 
the cluster, but is in fact comprised of early- type galaxies 
lying in the foreground of the cluster, at about z ~ 0.33, 
which we will discuss further as part of our multi-halo 
mass modeling (Section 8.2). 

The number density profile of the green sample is 
steeply rising toward the center (Figure 6, green crosses). 
The low number density at large clustercentric radius 
is indicative of negligible contamination of background 
galaxies of this sample. The WL signal for this pop- 
ulation is found to be consistent with zero at all radii 
(Figure 5, green crosses), also indicating the reliability 
of our procedure. For this population of galaxies, we 
find a mean photometric redshift of (z p hot) ~ 0.56 (see 
Section 3.3), consistent with the cluster redshift. Impor- 
tantly, the green sample marks the region that contains a 
majority of unlensed galaxies, relative to which we select 
our background samples, as summarized below. 

3.2. Background Sample Selection 

A careful background selection is critical for a WL 
analysis so that unlensed cluster members and fore- 
ground galaxies do not dilute the true lensing signal 
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Figure 2. "Blue" and "red" background galaxies are selected 
for WL analysis (lower left blue dashed and right red dot-dashed 
contours, respectively) based on Subaru B,Rq,z' color-color- 
magnitude selection. All galaxies (cyan) are shown in the diagram. 
At small clustercentric radius, an overdensity of cluster galaxies de- 
fines our "green" sample (green solid contour), comprising mostly 
the red sequence of the cluster and some blue trail of later type clus- 
ter members. The background samples are well isolated from the 
green region and satisfy other criteria as discussed in Section 3.2. 

of the background (Broadhurst et al. 2005a; Medezin- 
ski et al. 2007; Umetsu & Broadhurst 2008; Medezin- 
ski et al. 2010). This dilution effect is simply to re- 
duce the strength of the lensing signal when averaged 
over a local ensemble of galaxies (by a factor of 2-5 
at R ^ 400kpcft.~ 1 ; see Figure 1 of Broadhurst et al. 
2005a), particularly at small radius where the cluster is 
relatively dense, in proportion to the fraction of unlensed 
galaxies whose orientations are randomly distributed. 

We use the background selection method of Mcdczin- 
ski et al. (2010) to define undiluted samples of back- 
ground galaxies, which relies on empirical correlations for 
galaxies in color-color-magnitude space derived from the 
deep Subaru photometry, by reference to evolutionary 
tracks of galaxies (for details, see Medezinski et al. 2010; 
Umetsu et al. 2010) as well as to the deep photometric- 
redshift survey in the COSMOS field (Ilbert et al. 2009). 

For MACSJ0717, we have a wide wavelength coverage 
(BVR c i'z r ) of Subaru/Suprime-Cam. We therefore make 
use of the (B — R c ) vs. (R c — z') CC-diagram to care- 
fully select two distinct background populations which 
encompass the red and blue branches of galaxies. We 
limit the red sample to z' < 25 mag in the reddest band, 
corresponding approximately to a 5c limiting magnitude 
within 2" diameter aperture. We extend the magnitude 
limit of the blue samples further to z' < 26 mag, where 
the number density of galaxies grows significantly higher, 
especially for bluer galaxies whose faint-end slope of the 
luminosity function is rising, giving a much improved WL 
statistical measurement. 

For the background samples, we define conservative 
color limits, where no evidence of dilution of the WL 
signal is visible, to safely avoid contamination by un- 
lensed cluster members and foreground galaxies. The 
color boundaries of our "blue" and "red" background 
samples are shown in Figure 2. For both the blue and 
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Figure 3. Redshift distributions of CC-selected green, red and 
blue samples using the BPZ photo-z's based on Subaru+CFHT 
imaging. The cluster redshift is marked with a black line. 

red samples, we find a consistent, rising WL signal (see 
Section 4.1) all the way to the center of the cluster, as 
shown in Figure 5. 

As a further consistency check, we also plot in Fig- 
ure 6 the galaxy surface number density as a function of 
clustercentric radius, n(9), for the blue and red samples. 
As can be seen, no clustering is observed toward the cen- 
ter for the background samples, which demonstrates that 
there is no significant contamination by cluster mem- 
bers in these samples. The red sample systematically 
decreases in projected number density toward the clus- 
ter center, caused by the lensing magnification effect. A 
more quantitative magnification analysis is given in Sec- 
tion 4.2. 

To summarize, our CC-selection criteria yielded a to- 
tal of N=10490, 1252, and 11998 galaxies, for the red, 
green, and blue photometry samples, respectively (see 
Table 3). For our WL distortion analysis, we have a sub- 
set of 4856 and 4738 galaxies in the red and blue samples 
(with usable Rc shape measurements), respectively (see 
Table 4). 

3.3. Depth Estimation 

The lensing signal depends on the source redshift z s 
through the distance ratio /3(z s ) — D\ s /D s , where D\ s , D s 
are the angular diameter distances between the lens and 
the source, and the observer and the source, respectively. 
We thus need to estimate and correct for the respective 
depths (/3) of the different galaxy samples, when convert- 
ing the observed lensing signal into physical mass units. 

For this we utilize BPZ to measure photometric red- 
shifts (photo-zs) Zphot using our deep Subaru+CFHT 
u* BVR c i'z', J K s photometry (Section 2.1). BPZ em- 
ploys a Bayesian inference where the redshift likelihood 
is weighted by a prior probability, which yields the prob- 
ability density P(z, T\m) of a galaxy with apparent mag- 
nitude ?n of having certain redshift z and spectral type 
T. In this work we used a new library (N. Benitez 
2012, in prep.) composed of 10 SED templates originally 
from PEGASE (Fioc & Rocca-Volmerange 1997) but re- 
calibrated using the FIREWORKS photometry and spec- 
troscopic redshifts from Wuyts et al. (2008) to optimize 
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Figure 4. Lensing depth (D\ S /D B ) as a function of Subaru z'- 
band limiting magnitude for the red and blue background samples, 
as estimated from the photometric redshifts of COSMOS to z' < 25 
(circles). In order to estimate the depth of the blue sample to its 
limiting magnitude of z' < 26 we extrapolate the curve (x's). 

its performance. This library includes five templates for 
elliptical galaxies, two for spiral galaxies, and three for 
starburst galaxies. In our depth estimation we utilize 
BPZ's ODDS parameter, which measures the amount of 
probability enclosed within a certain interval Az centered 
on the primary peak of the redshift probability density 
function (PDF), serving as a useful measure to quantify 
the reliability of photo- z estimates. 

We only consider galaxies from the WL-matched cat- 
alogs, as those are the galaxies from which we estimate 
the lensing signal and finally the mass profile. To make 
this estimate more robust, we use galaxies for which the 
photo-z was determined using all available 8 bands. We 
show the normalized redshift distribution of these galax- 
ies in each of the green, red and blue samples in Figure 3. 
Still, since only 12 spectroscopic redshifts are publicly 
available in this field, it is difficult to estimate the reli- 
ability of our photo-z's. We therefore further compare 
our results with the more reliably estimated depths we 
derive from the COSMOS catalog (Ilbert et al. 2009), 
which has robust photometry and photo-z measurements 
for the majority of galaxies with i' < 25 mag. For each 
sample, we apply the same CC selection to the COS- 
MOS photometry and obtain the redshift distribution 
N(z) of field galaxies. Since COSMOS is only complete 
to i' < 25 mag, we derive the mean depth as a function 
of magnitude (fig. 4) up to that limit, and extrapolate to 
our sample limiting magnitude, z' — 25 in the case of the 
red sample, and z' = 26 in the case of the blue sample. 

For each background population, we calculate a 
weighted mean of the distance ratio j3 (mean lensing 
depth) as 

fdzw(z)N(z)/3(z) 

[P) Jdzw(z)N(z) ' 



(1) 



where w(z) is a weight factor; w is taken to be the 
Bayesian ODDS parameter for the BPZ method, and 
w = 1 otherwise. The sample mean redshift (z s ) is de- 
fined similarly to Equation (1). 



In Table 4 we summarize the mean depths (0) and 
the effective source redshifts z a ,e& f° r our background 
samples. For each background sample, we obtained con- 
sistent mean-depth estimates ((3) (within 2%) using the 
BPZ- and COSMOS-based methods. In the present 
work, we adopt a conservative uncertainty of 5% in the 
mean depth for the combined blue and red sample of 
background galaxies, (/3(back)) = 0.48 ±0.03, which cor- 
responds to z Sj0 ff = 1.26 ± 0.1. We marginalize over this 
uncertainty when fitting parametrized mass models to 
our WL data. 

4. SUBARU WEAK-LENSING ANALYSIS 

In this section we describe the WL analysis based on 
our deep Subaru imaging data. In Section 4.1 we derive 
cluster lens distortion and in Section 4.2 we derive the 
magnification radial profiles from the data. 

4.1. Tangential Distortion Analysis 

The shape distortion of an object is described by the 
complex reduced-shear, g = g\ + ig2, where the reduced- 
shear is defined as (in the subcritical regime; see, e.g., 
Bartelmann & Schneider 2001), 

9a = 7a/(l ~ «), (2) 

where 7 is the complex gravitational shear field and is 

non-locally related to the convergence, k — £/E cr j t , 

which is the surface mass density in units of the criti- 

2 -. 

cal surface-mass density for lensing, £ cr it = 4 qj; @~ ■ 

The tangential component of the reduced-shear, g + , is 
used to obtain the azimuthally-avcragcd distortion due 
to lensing, and computed from the distortion coefficients 
(5i,ff2): 

9+ = — (.9i cos 29 + gi sin 29) , (3) 

where 9 is the position angle of an object with respect 
to the cluster center. The uncertainty in the object g + 
measurement is a+ = a g j\j2 = a in terms of the RMS er- 
ror o~ g for the complex reduced-shear measurement. For 
each galaxy, o~ g is the variance for the reduced shear esti- 
mate computed from 50 neighbors identified in the r g -Rc 
plane. To improve the statistical significance of the dis- 
tortion measurement, we calculate the weighted average 
of 9+ 



9+,: 



9+ifii) 



J2 u '( fe ) 9+W 

.kei 



E 



w 



(k) 



(4) 



where the index k runs over all objects located within 
the ith radial bin with a weighted center of 9i, and w^k) 
is the weight for the k-th object, 



'j( fc ) = l/(o- 2 g(k) +a 2 ), 



(5) 



where a 2 is the softening constant variance. We choose 
a = 0.4, which is a typical value of the mean RMS a g 
over the background sample. The uncertainty in g + ^ 
is calculated from a bootstrap error analysis (for de- 
tails, see Umetsu et al. 2012). Since WL only induces 
curl-free tangential distortions, the 45 deg-rotated com- 
ponent, g-K = — (52 cos 2(f) — gi sin 2(f) , is expected to van- 
ish. It is therefore useful as a check for systematic errors. 
In Fig. 5 we plot the radial profile of <?+ of the green, 
red and blue samples defined above. The black points 
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represent the red+blue combined sample (also shown in 
Figure 8, upper panel), showing the best estimate of the 
lensing signal, which is detected at 11.6cr significance 
over the full radial range. The red and blue samples 
profiles rise continuously toward the center of the clus- 
ter, and agree with each other within the errors, except 
at the very central bin, 9^2', where measurement are 
approching the non-linear regime, given the extremely 
elliptical shape of the tangential critical curve (see Fig- 
ure 1 of Zitrin et al. 2009a). The overall rising trend and 
agreement demonstrate that both the red and blue sam- 
ples are dominated by background galaxies and are not 
contaminated by the cluster at all radii. The g+ profile of 
the green sample agrees with zero at all radii. The mea- 
sured zero level of tangential distortion reinforces our CC 
selection of the green sample to consist of mostly cluster 
members. 
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Figure 5. Azimuthally-averaged radial profiles of the tangential 
reduced-shear </+ (upper panel) and the 45° rotated (x) compo- 
nent g x (lower panel) for our red (triangles), blue (circles), green 
(crosses), and blue+red (squares) galaxy samples. The symbols for 
the red and blue samples are horizontally shifted for visual clarity. 
For all of the samples, the X -component is consistent with a null 
signal detection well within 2<r at all radii, indicating the reliability 
of our distortion analysis. 



4.2. Magnification-Bias Analysis 

We follow the prescription of Umetsu et al. (2011b) to 
measure the magnification bias signal as a function of 
distance from the cluster center, which depends on the 
intrinsic slope of the luminosity function of background 
sources s, as 



,(0) = no/i(0 



,2.5s-l 



(6) 



where n = dN (< m cut )/d£l is the unlensed mean num- 
ber density of background sources for a given magni- 
tude cutoff TO cut , approximated locally as a power-law 
cut with slope, s — d\og 10 N (< m)/dm > 0. We use 
the red sample of background galaxies (Section 3.2), for 
which the intrinsic count slope s at faint magnitudes is 
relatively flat, s ~ 0.1, so that a net count depletion 
results (Broadhurst et al. 2005a; Umetsu & Broadhurst 
2008; Umetsu et al. 2010, 2011b). In contrast, the blue 
background population has a steeper intrinsic count slope 
close to the lensing invariant slope (s = 0.4). 
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Figure 6. Surface number density profiles n(8) of Subaru BR c z'- 
selected samples of galaxies. The red (triangles) and blue (circles) 
samples comprise background galaxies, and the green (crosses) 
sample comprises mostly cluster galaxies, as is evident by their 
steeply rising number counts toward the center. See also Figure 8. 

Here we use the approach developed in Umetsu et al. 
(2011b) to measure the azimuthally-averaged surface 
number density profile of the red galaxy counts, n^.i = 
n^{0i) (red triangles in Fig. 6), taking into account and 
correcting for masking of background galaxies due to 
bright cluster galaxies, foreground objects, and saturated 
objects. The errors <t„ , for n^ ti include both contribu- 
tions from Poisson errors in the counts and contamina- 
tion by intrinsic clustering of red background galaxies. 
The normalization and slope parameters for the red sam- 
ple are reliably estimated outside the lensed region, by 
virtue of the wide-field imaging with Subaru/Suprimc- 
Cam. We find no = 13.3 ± 0.3galaxies/acrmin 2 and 
s = 0.123 ±0.048. 

We show in the bottom panel of Figure 8 the measured 
magnification profile from our flux-limited sample of red 
background galaxies (z' < 25 mag; see Table 3) with and 
without the masking correction applied (red circles and 
green crosses, respectively). A clear depletion of the red 
counts is seen in the central, high-density region of the 
cluster and detected out to £ 4' from the cluster center. 
The radially integrated significance of the detection of 
the depletion signal is 5.3er. 

The high-level WL shear and magnification profile we 
derived here can be combined together to reconstruct the 
underlying mass profile. However, to better resolve the 
core of the profile, we require further constraints which 
are enabled by SL. We therefore derive the SL mass pro- 
file in the next section. 

5. HST STRONG LENSING ANALYSIS 

For a massive cluster, the weak- and strong-regimes 
contribute similar logarithmic coverage of the mass pro- 
file. Hence, the central SL information is crucial in a 
cluster lensing analysis (Umetsu et al. 2011b, a, 2012). In 
this section we derive a SL model to compare with our 
WL profile in the overlap region, and to combine with 
WL in deriving the mass reconstruction in section 7. 

First we summarize our well-tested approach to strong- 
lens modeling, developed by Broadhurst ct al. (2005b) 
and optimized further by Zitrin et al. (2009b) (see also 
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Zitrin et al. 2013). Briefly, the adopted parametrization 
is as follows. Cluster members, chosen by a F814W- 
F555W color criterion, are each represented by a power- 
law mass density profile. The superposition of all galaxy 
contributions constitutes the galaxy, lumpy component 
for the model. This component is then smoothed using 
a 2D spline interpolation to comprise the DM compo- 
nent. The two components are then added with a rel- 
ative weight. In order to allow further degrees of free- 
dom (DOF), and higher effective ellipticity of the crit- 
ical curves, an external shear is added. In total, the 
method thus includes six basic free parameters: (1) the 
power-law of the galaxy mass profile; (2) the smoothing 
(polynomial) degree of the DM component; (3) the rela- 
tive weight of the galaxy to the DM component; (4) the 
overall scaling or normalization; (5) the amplitude and 
(6) angle of the external shear (for more details see Zitrin 
et al. 2009b). 

Using this method, in Zitrin et al. (2009a) we per- 
formed the first SL analysis of this cluster using 3- 
band publicly available HST imaging, and found 34 
multiple-images from 13 lensed sources, uncovering that 
MACSJ0717 is the largest known lens. As part of 
CLASH (Postman et al. 2012), we further observed 
MACSJ0717 with HST in 16 filters with the ACS and 
WFC3/UVIS+IR cameras. We thus revise here our pri- 
mary multiple image identification, and follow in general 
the multiple image sets listed in Table 1 of Limousin et al. 
(2012, hereafter L12), who recently used Zitrin et al. 
(2009a) 's systems (with the exclusion of three systems, 
9-11) and added five additional multiple systems uncov- 
ered in the publicly-available CLASH data. Although 
we agree with most of the identifications and revisions 
by L12, we determine image 1.5 to be at a different loca- 
tion. In addition, although the model suggests these may 
be counter images of the same source, we omit system 2 
from our list, since it strongly deviates from the D\ s / D a 
scaling relation expected, and as determined from visual 
inspection. Aside from these corrections, we use in to- 
tal 43 multiple-images coming from the other 14 sources 
listed in L12 as our SL constraints here. Internal bright 
knots in some of the images are added as further con- 
straints. 

We use a several dozen thousand step Monte-Carlo 
Markov-Chain (MCMC) minimization in order to find 
the best-fit solution, defined by the image-plane repro- 
duction x 2 ■ The advantage of this light-traces-mass 
(Zitrin et al. 2009a) method is that even very complex 
systems such as MACSJ0717 are still well fitted by this 
simple procedure, although it may not be expected to 
yield an RMS as low as in a multi-halo/parameteric fit 
(e.g. L12). Note however, that even with a somewhat 
higher RMS, the representation is still highly credible 
as it allowed the identification of many multiple image 
systems (Zitrin et al. 2009a). 

Here, in practice, to allow for more freedom and a bet- 
ter RMS, we also leave the relative weight of 10 galaxies 
to be freely optimized by the MCMC. With this, the 
final model we present here has an RMS of 3.86" and 
a x 2 °f — 334 (with a location error of a — 1.4"), over 
32 DOF. The relatively large reduced x 2 (compared with 
the reasonable RMS) may indicate that in such a complex 
system, a position error of 1.4" may be underestimated, 



not taking into account stronger LSS and complexity ef- 
fects. Since the best-fit statistical solution defined by the 
image positions may not always reproduce the multiple 
images with the right internal shape or orientation, mul- 
tiple images are then sent to the source plane and back 
through the lens to test, by eye, the reproduction of other 
multiple images of the same systems (e.g., Zitrin et al. 
2009a, b). Note that only 5 systems have spectroscopic 
redshift, so that we use the input from L12 as the pre- 
dicted redshift of the other systems. We do not leave any 
of these redshifts to be optimized by the model, which 
may have lowered further the RMS of our model. 

We present the azimuthally-averaged projected mass 
density profile from the resulting SL model of 
MACSJ0717 in Figure 13 (green curve). The density 
profile shows a remarkably flat core out to ^ 180kpc//i, 
as was noted in previous SL analyses of this cluster 
(Zitrin et al. 2009a). The shallow profile is in accor- 
dance with the non-relaxed appearance of this cluster 
and reported multiple mass clumps at the core of the 
cluster. According to our SL model, the total projected 
mass enclosed within a radius of 60 ± 6", the effec- 
tive Einstein radius at z s — 2.963, is A^2d(< 60") = 
(4.87 ± 0.35) x 10 14 M Q /h, which is in good agreement 
with the mass enclosed within the tangential critical 
curve, M 2D = (5.5 ±0.35) x 10 14 M /h. We now turn to 
the WL analysis in the next sections in order to recover 
the mass over the full scale of the cluster. 

6. HST WEAK-LENSING ANALYSIS 

In order to further constrain the mass profile in the 
cluster center, where the small Subaru number density 
leads to large uncertainties, we preform a complemen- 
tary WL analysis of the HST 16-band data in the weak 
regime. Details of our reduction, photometry and photo- 
z pipeline were given in previous papers (Postman et al. 
2012; Zitrin et al. 2012a; Coc et al. 2012). Here we 
further produced specialized drizzled images, optimized 
for WL, consisting of drizzling each visit in the "unro- 
tated" frame of the ACS/WFC3 detectors, using a mod- 
ified version of the " Mosaicdrizzle" pipeline (described 
more fully in Koekemoer et al. 2011). This allows ac- 
curate PSF treatment that does not compromise the in- 
trinsic shape measurements required by WL pipelines. 
The RRG (Rhodes et al. 2000) WL shape measurement 
package was then used to measure shapes in each of six 
ACS bands (F435W, F475W, F625W, F775W, F814W 
and F850LP), and the DEIMOS (Mclchior et al. 2011) 
package was used to measure shapes in the WFC3/IR 
F160W band. We exclude objects with S/N < 10 and 
size < 0.1'. All the shape catalogs were then matched 
to the deep multi-band photometric catalog and merged 
where there was more than one shape measurement per 
object, using its S/N as input weight of each measure- 
ment, according to eq. 5 with a = 0.2 as the softening 
kernel in this case. 

Since the HST field has reliable photometric redshifts 
measured from 16 bands spanning UV to IR, here we 
rely on those for a secure selection of background galax- 
ies. We define our background sample as galaxies hav- 
ing 0.8 < z h < 4, z b , min > 0.6, z b , max < 5, 22.5 < 
mF625W < 27.5, where z\, is the bayesian photomet- 
ric redshift from BPZ, and Zb,min, zt>,max are the 68% 
lower and upper bound on the photometric redshift es- 
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Figure 7. As in Figure 5, we show the tangential distortion profile 
of the Subaru background sample (black squares), and compare 
with the ffi?T-derived background sample in the central R < 2.4' 
(magenta circles) but outside the critical lensing region, R > f.5'. 
As can be seen, these two complementary dataset give consistent 
WL signal at the region of overlap, whereas the HST points have 
much smaller errors and therefore provide better constraints. 

timate, respectively. To examine the light distribution, 
we chose an inclusive cluster-member sample based on 

|Zb — Zcl| < 0.1, Z b ,min > Zcl-0.2, Z b , max < Z c l + 0.2, 17 < 

r nF625W < 24.5. In order to do a simultaneous analysis 
of the WL signal in both the HST and Subaru fields we 
need to account for the different redshift distribution of 
the different populations these two datasets target. We 
do this by estimating the depth factor, /3(z), from the 
-ffSTphoto-z's. For the HST background catalog we esti- 
mate B ~ 0.525, about a ~ 10% increase in depth relative 
to the Subaru catalog, estimated at B ~ 0.48. We apply 
this relative correction to the HST catalog and scale it 
to match the Subaru catalog. 

A significant part of the HST region resides inside the 
tangential critical curve area, and so is super-critical. In 
order to avoid nonlinear effects in the WL analysis, we 
examine the tangential distortion profile only at the outer 
region of HST, 1.5 < 9 < 2.5', to the limit of the data. 
We present the results in Figure 7, where we overplot the 
inner two bins (magenta circles) from HST that overlap 
with the same region in the inner Subaru (black squares) 
profile. The two independent datasets show consistent 
WL shear signal within the uncertainties, although we 
note the HST signal does show a slightly decreased level. 
This may arise from nonlinear effects causing an underes- 
timation of the shear, or simple image edge effects due to 
different filter coverage. However, this level is still negli- 
gible and does not affect our results significantly. We will 
present the 2D distribution analysis of the HST region in 
section 8.4, after we first incorporate the .ffST'+Subaru 
WL with HST SL information to reconstruct the mass 
profile over the entire Subaru field of view (FOV) in the 
next section. 

7. RADIAL MASS PROFILE ANALYSIS 

7.1. Mass Profile Reconstruction using 
One- dimensional Shear and Magnification 

We derive the cluster mass profile as a function of 
clustercentric radius from a joint likelihood analysis 



Figure 8. Top: Tangential reduced shear profile g+ (9) (squares) 
based on HST and Subaru distortion data of the composite full 
background sample. Bottom: Coverage-corrected count profile 
n(8) (circles) for a flux-limited sample of red background galaxies 
registered in the Subaru BR c z' images. The horizontal bar repre- 
sents the constraints on the unlensed count normalization, no, as 
estimated from Subaru data. Also shown in each panel is the joint 
Bayesian reconstruction (68% CL; solid area) from SL, WL tan- 
gential distortion (squares), and magnification-bias measurements 
(circles). 
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Figure 9. Top: Surface mass density profile S(_R) (squares) de- 
rived from a joint SL, WL distortion and magnification (WL+SL) 
likelihood analysis of our HST+Sub&vu lensing observations. The 
gray area represents the best-fit NFW profile for the mass pro- 
file solution T,(R). Bottom: The cumulative mass M2d(< R) (red 
squares) as derived from the full lensing analysis. The gray area 
is the NFW fit (lcr confidence interval) to the WL+SL constraints 
as described above. 

of independent WL distortion (g+), magnification-bias 
(rift), and SL projected mass (m) constraints, {g+^i}^ 



AU 



(Section 4.1), {n^}"^i (Section 4.2), and {mj-^i 
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(Section 5), respectively, following the Bayesian ap- 
proach of Umctsu (2013), who extended the shear-and- 
magnification analysis method of Umctsu ct al. (2011b) 
to include the inner SL information. Such a multi- 
probe approach is critical for improving the accuracy and 
precision of the cluster lens reconstruction, effectively 
breaking the mass-sheet degeneracy (see Bartclmann & 
Schneider 2001; Umetsu et al. 2012). Adding SL informa- 
tion to WL is needed to provide tighter constraints on the 
inner density profile (e.g., Umetsu & Broadhurst 2008; 
Umctsu et al. 2011b, 2012). The shear+magnification 
method of Umetsu ct al. (2011b) has been extensively 
used to reconstruct the projected mass profile in a dozen 
clusters (Umetsu ct al. 2011b; Zitrin ct al. 2011; Coc 
ct al. 2012; Umctsu ct al. 2012; Zitrin ct al. 2013). In 
all cases, we find a good agreement between independent 
WL and SL mass profiles in the region of overlap. 

Briefly summarizing, the model is described by a vec- 
tor s of parameters containing the discrete convergence 
profile {Koo,i}iLi, given by N — N w \ + N s \ binned k val- 
ues, and the average convergence enclosed by the in- 
nermost aperture radius # m i n for SL mass estimates, 
^oo,min = ^oo(< ^min)i where we have introduced the 
convergence for a fiducial source in the far background 
of the cluster, Hod = n(9i]z s — > oo). The model s — 

{Koo^nin, K oo,i}^li is then specified by a total of (N + 1) 
parameters. Additionally we account for the uncertainty 
in the calibration parameters, c = {w g , w lll no, s), namely 
the population-averaged lensing strengths for the distor- 
tion and magnification measurements (see Table 4) , w g = 
(/3(back))//3(z s -> oo) and w^ = (/J(red)) / '/3(z 8 -> oo), 
the normalization and slope parameters (no, s) of the red- 
background counts (see Section 4.2). The covariance ma- 
trix Cij for the profile reconstruction is also constructed, 
and used for calculating the likelihood function of the 
combined WL+SL observations. 

In the present analysis, we calculate the g + and n^ 
profiles in JV W ] = 10 clustercentric radial bins, span- 
ning the range 9 = [1.5', 16'], with a constant loga- 
rithmic radial spacing A hi (9 ~ 0.237. Additionally, we 
use our projected mass measurement within a radius of 
60", m = M 2D (< 60") = (4.87 ±0.35) x 1O 14 M h~ x 
(N s \ — 1), tightly constrained by our detailed strong- 
lens modeling (Section 5). Note that, enclosed masses 
at the location around the Einstein radius (#Ein ~ 60" 
at z s = 2.963 here) are less sensitive to modeling as- 
sumptions and approaches (see Umetsu ct al. 2012), 
serving as a fundamental observable quantity in the SL 
regime (Coe et al. 2010). Hence, we have a total of 
Ntot = 27V w i + N s \ = 21 constraints. The mass pro- 
file model is described by N + 1 = 12 profile parameters 
and additional 4 calibration parameters (c) to marginal- 
ize over. 

The resulting mass profile s from a joint SL+WL likeli- 
hood analysis of our ifST+Subaru observations is shown 
in the top panel Figure 9 (red squares; see also Figure 13, 
red squares). We find a consistent mass profile solution 
s as displayed in Figure 8 (solid areas). The projected 
cumulative mass profile M%£>(< 9) is shown in the lower 
panel (red curve). It is given by integrating the density 



profile s = {k 



oo, mm ; ^00,2 



}i=i ( see Appendices A and B 



of Umctsu ct al. 2011b) as 
M 2D (< 6 t ) = 

7r(A0min) 2 S cri t«mm + 27r£>, 2 E crit / d\n9 9 2 k{9). 



(7) 



The total projected mass enclosed within a radius of 9 ss 
7' « 1.88Mpc//i is found to be M 2D = (2.9 ± 0.5) x 
10 15 M Q /h. 

As is evident from the cored density profile in the 
central region derived by SL, as well as from the flat- 
tened outer profile at large radii seen by WL (Figure 13), 
MACSJ0717 is a complex, non-relaxed cluster whose 
matter distribution is not well described by a single NFW 
profile. However, in order to derive a total (spherical) 
mass estimate of the main cluster component in a com- 
plementary (yet model-dependent) approach, we choose 
here to fit our full-lensing mass profile with a spherical 
NFW model. 

To that end, the projected radial mass profile s is fitted 
with a model consisting of a halo component described 
by the two-parameter universal NFW profile, knfw($)j 
and a constant mass-sheet component, n c : 



k(9) = knfw(#) + « 



(8) 



where the constant k c approximates the inherent "two- 
halo" term contribution due to the clustering of halos 
(see Oguri & Hamana 2011). The NFW mass density 
profile is given by the form 



PnfwM 



{r/r s )(l + r/r s y 



(9) 



where p s is the characteristic density, and r s is the char- 
acteristic scale radius at which the logarithmic density 
slope is isothermal. The halo virial mass is then given 
by integrating the NFW profile (Eq. 9) out to the virial 
radius r v i r , M v ; r = M(< r v ; r ). We specify the projected 
NFW model with the halo virial mass, M V - 1T , and the de- 
gree of concentration, c v ; r = r v [ r /r s . We refer all our 
virial quantities to an overdensity of A c = A v j r w 138 
based on the spherical collapse model using the fitting 
formula by Kitayama & Suto (1996, their Appendix A). 

We constrain the model parameters p = (M v i r , c v j r , k c ) 
with our full-lensing mass profile s. The \ 2 function for 
our SL+WL observations is 34 



X 



\p) = J> - h(p)]C^ l [ Sj - sjip)}, (10) 



where Sj is the model prediction for the convergence pro- 
file Si at 9i , C is the full covariance matrix of s defined as 
C = C + Ci ss with Ci ss being the cosmic covariance ma- 
trix responsible for the uncorrelated LSS projected along 
the line of sight. For details, see Umetsu et al. (2011a). 
Our best-fit NFW model to the combined lensing con- 
straints (Equation 10) is shown in Figure 9 as the gray 
shaded area. To summarize our results from this anal- 
ysis, we obtain a total virial mass estimate of M V j r = 

33 A v ; r « 140 using the fitting formula by Bryan & Norman 
(1998). 

34 The calibration uncertainties in observational parameters, 
such as the background mean depths, (/3(back)} (or z Sj eff) an d 
(jS(rcd)), have already been marginalized over in the Bayesian mass 
profile reconstruction. 
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(1.94±g;||) x 1O 15 M //i - (2.8 ±0.6) x 1O 15 M with 

the minimized x 2 of 10 for 9 DOF (see a summary in 
Table 5). We will compare and discuss all the mass esti- 
mates yielded by the different modeling and reconstruc- 
tion methods explored in the paper in Section 9.1. 

At larger radii, a flattening of the mass profile is ob- 
served, possibly indicative of the surrounding LSS. The 
deviation of the profile from a single spherical NFW halo 
at large radius is indicative of substructure associated 
with this cluster region, and therefore merits a more care- 
ful two-dimensional (2D) analysis, which we present in 
the next sections. 

7.2. Mass Profile Reconstruction using 
Two-dimensional Shear and Magnification 

We follow Umctsu et al. (2012) to extend the one- 
dimensional Bayesian method above (Section 7.1) into a 
2D mass distribution by combining the spatial shear pat- 
tern (gi(6) , 92(0)) with the azimuthally-averaged mag- 
nification measurements n^(0) (Section 4.2), imposing a 
set of azimuthally-integrated constraints on the underly- 
ing k(0) field. 35 For details of the method, we refer the 
reader to Umetsu et al. (2012, their Appendix A. 2). 

By combining complementary WL distortion and mag- 
nification data in a non-parametric manner, we construct 
a 2D mass map over a 48 x 48 grid with 0.5' spacing cov- 
ering the central 24' x 24' field. 36 We show in Figure 13 
(black circles) the azimuthally-averaged radial mass pro- 
file S(i?) produced from the resulting mass map, given 
in 10 linearly-spaced radial bins spanning from 9 = 1.5' 
to 16'. The innermost bin with the horizontal bar repre- 
sents the mean interior mass density S(< 1.5') as marked 
at the area-weighted center 8 = 1'. We find a model- 
independent constraint on the total enclosed mass within 
9 w 7' to be M 2D = (3.3 ± 0.6) x 10 15 M Q /h. 

We fit an NFW + mass-sheet model (Equation 8) to 
the WL radial mass profile derived here by minimiz- 
ing the total x 2 function defined as in Equation (10), 
marginalizing over the mean background depth uncer- 
tainty in z S) eff. We find the total virial mass to be 
Af vir = {2.15±° ID x lO ls M //i. We summarize all the 
results from our analyses in Table 5 and discuss the dif- 
ferences in Section 9.1. 

8. SPATIAL MASS DISTRIBUTION ANALYSIS 

Here we present the full 2D mass distribution in the 
cluster and its surrounding structures as constrained by 
WL shear in 8.1, and we further constrain the mass of 
the individual components by modeling identified halos 
independently in 8.2. We present another 2D inversion 
method, SawLenS, in 8.3 which incorporates information 
from WL shear and SL on a multi-scale grid. Finally we 
probe the substructure in the core of the cluster in 8.4. 

8.1. Mass and Light Distributions of the Cluster and 
Surrounding Large-Scale Structure 

35 Since the degree of magnification is locally related to k, this 
will essentially provide the otherwise unconstrained normalization 
of k(0) over a set of concentric annuli where count measurements 
are available. We note that no assumption is made of azimuthal 
symmetry or isotropy of the cluster mass distribution. 

36 The magnification analysis is limited within the central 24' X 
24' region where the number counts of red background galaxies are 
reliably measured. 



WL distortion measurements (g) can be used to recover 
the underlying projected mass density field £(<?). Here 
we use the linear map-making method outlined in Section 
4.4 of Umetsu et al. (2009, see their Section 4.4) to derive 
the projected mass distribution from the -ff^T+Subaru 
distortion data presented in the previous sections. 

In Figure 10 (top left panel), we show the surface- 
density field k(0) in the central 28' x 28' region, recon- 
structed from the background (blue+red) sample (Sec- 
tion 3.2), where for visualization purposes the mass map 
is smoothed with a Gaussian of 6>fwhm = 1-5'. We 
overlay the mass map with contours starting at 2.5c K 
above the background, which is equivalent to a detec- 
tion threshold of £ 2 .5<t k = 3.66 x 10 14 /iM /Mpc 2 , and 
separated by 2o~ K intervals. 

A very elongated structure is evident, and several mass 
clumps seem to comprise the cluster. Some clumps 
lie outside of the estimated virial radius which is at 
~ 2 Mpc/h. Some of these mass structures may in fact 
lie in the foreground of the cluster and not be physically 
associated with it, yet contribute to the overall lensing of 
the background galaxies. We therefore need to estimate 
the mean redshift of each of these significant structures. 

In order to estimate the distances to the detected struc- 
tures, and to correlate the galaxy distribution in this 
cluster and its surrounding structure with the DM distri- 
bution, we utilize the green sample defined in Section 3.1, 
which comprises mostly cluster galaxies and some lower- 
level contamination of foreground galaxies. We construct 
both a 2D galaxy number-density map (Figure 10, top 
right panel) and a if -corrected i?c _ band luminosity den- 
sity map (Figure 10, bottom right panel). Both maps 
are smoothed with a Gaussian of the same scale as the 
mass map above, #fwhm = 1.5' . White contours are 
overlaid with 5a increments starting at 2.5c above the 
background level of the equivalent map. We also overlay 
the surface mass density map (red contours) as deter- 
mined above to illustrate the correlation. 

Overall, the DM distribution is well traced by the 
galaxy distribution, as evident by the overlapping con- 
tours and the proximity of the mass peaks to the light 
peaks. The good agreement seen is consistent with the 
smaller mass halos being in the process of accreting onto 
the main halo. A further detailed analysis of individual 
structures is presented below (Section 8.2). 

8.2. Multi-Halo Mass Reconstruction 

To estimate individual masses of the structures com- 
prising the cluster and its surroundings as seen within the 
scope of the Subaru, ~ 0.25 deg 2 , we now present a multi- 
halo fitting approach. We first identify the most domi- 
nant peaks from the convergence map presented above 
(Section 8.1, Figure 10). These are defined as all peaks 
lying 2.5cr K above the background level, estimated from 
a bi- weighted scale and mean (Beers et al. 1990), respec- 
tively, outside the region of 8' from the cluster center, so 
as not to be biased by the cluster potential. The 2.5<r K 
detection level is indicated as the first red contour in Fig- 
ure 10, corresponding to £2.50- = 3.66 x 10 14 ft.M /Mpc 2 . 

We have detected nine distinct mass peaks, numbered 
z\-z<j in Figure 10 (top left, the peak locations are 
marked with red points) and in Table 5. To help iden- 
tify if these peaks are part of the same structure as the 
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Figure 10. Top Left: linear reconstruction of the dimensionless surface-mass density distribution, or the lensing convergence k(0) = 
E(0)/£ cr ; t , reconstructed from Subaru distortion data. The lowest red contour represents our detection level at 2.5cx K , with further 
increments of 2a K . Red points denote the locations of the 9 significant mass peaks identified above 2.5<r re . Top Right: Surface number 
density distribution E n (0) of green galaxies, representing cluster member galaxies. White contours show 5<r n increments starting at 2.5o- n . 
Also overlaid are the red contours of the convergence, showing very good agreement between the DM and the galaxy distributions. Bottom 
Left: Dimensionless surface-mass density distribution reconstructed from the multi-halo shear-fitting analysis, comprising the NFW halos 
fitted to the 9 main mass peaks detected in 10. The most massive central component was fitted by an elliptical-NFW model, whereas the 
other peaks are fitted by a simple NFW model. The parameters of each halo fit are given in Table 5. Bottom Right: Luminosity density 
distribution S;(0) of green galaxies, representing cluster member galaxies. White contours show 5<t; increments starting 2.5<t; in number 
density. Also overlaid are the red contours of the convergence. 



cluster or unassociated systems projected along the line 
of sight, we estimate the photometric redshift of each 
component. For this we take the mean photometric red- 
shift of matching member galaxies in the green sample 
(section 3.1) lying within 1' from the respective mass 
halo peak. As evident, only five of the nine structures 
lie at approximately the same redshift (22, £3, 25, zg, zg) 
as that of the cluster, whereas the other mass clumps 
lie at zi = 0.29, z 4 = 0.42, z 7 = 0.43, z s = 0.34, in the 
foreground of the cluster. 

We then preform a 2D shear analysis to constrain the 
mass properties of the cluster and its surrounding struc- 
ture, modeled as the sum of 9 mass halos in projec- 
tion space. The multi-halo shear modeling procedure 
described here is similar to those of Watanabe et al. 
(2011), Okabe et al. (2011), and Zitrin et al. (2012b), 
but including an elliptical halo model as described be- 
low. More details will be presented in our forth-coming 



paper (Umetsu et al. 2013, in preparation). 

First we construct a reduced-shear map (gi(G), <?2 (#)) 
on a regular grid of AT cc n = 42 x 42 independent cells, 
covering a 28' x 28' region centered on the cluster. We 
exclude from our analysis those cells lying within 1.5' 
from the cluster center and those lying within 0.5' from 
the other less-massive halo peaks, to avoid potential sys- 
tematic errors due to contamination by unlensed cluster 
member galaxies (Section 3). This leaves us with a total 
of 1813 usable measurement cells, corresponding to 3626 
constraints. 

We describe the primary mass peak, responsible for 
the main cluster, as an elliptical NFW (eNFW, hereafter) 
model specified with 6 parameters, namely the halo virial 
mass (M v i r ), concentration (c V i r ), ellipticity (e = 1 — 
b/a), position angle of the major axis (# e )> and centroid 
position (X C ,Y C ). We introduce the mass ellipticity e 
in the isodensity contours of the projected NFW profile 
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S(X,y) asi? 2 = (X-X C ) 2 + (Y-Y C ) 2 -> (X-X c ) 2 (l- 
e) + (Y-Y c ) 2 /(l~e) (see Oguri et al. 2010; Umetsu et al. 
2012). For each of the other 8 less massive halos, we 
assign a spherical NFW profile parametrized with the 
virial mass (-M v i r ), where the centroid position is fixed 
at the respective mass peak location. Since these less 
massive halos are less resolved by our WL observations, 
their concentration parameters are set according to the 
mass-concentration relation c v i r (M vlr , z) given by Duffy 
et al. (2008). 

We use the MCMC technique with Metropolis- 
Hastings sampling to constrain the multi-halo lens model 
from a simultaneous 9-component fitting to the reduced- 
shear map. The best-fit parameters are reported in 



Table 5. We find virial masses of M v - ir = (1.71 ± 
0.29) x 10 15 M //i for the main cluster halo, z 6 , and 
masses of, M vir = (0.15 ± 0.08), (0.27 ± 0.12), (0.22 ± 
0.11), (0.15 ± 0.08) x 10 15 M Q /h for the other smaller 
halos, Z2, Z3, Z5, zg, respectively, that lie at the same red- 
shift. 

Comparing the cluster luminosity and galaxy maps, 
most of the significant mass structures detected in the 
mass map are also probed by the galaxies, with the ex- 
ception of two lower-significance structures, marked as 
z$ and Zg. The mass structure zs is not evident in the 
galaxy density map, but can be explained by the exis- 
tence of lower-redshift galaxies seen therein, z 8 = 0.34, 
which is close to the limit probed by our color-selection 
method (see Figure 3, green). The other mass structure, 
zg, is not detected in the galaxy density map, however, 
a low-significance peak is seen (more evident in the lu- 
minosity map) offset by ~ 1.5' to the north-east, where 
visual inspection reveals a bright cluster galaxy (BCG), 
albeit also at z ~ 0.34, therefore not part of the cluster 
structure. A ~ 1.5' offset is comparable to our smoothing 
scale, therefore within errors. 

To further determine if the detected and modeled ha- 
los are real, and if their content represents the typical 
stellar content of cluster-sized halos, we calculate the 
mass-to-light (M/L) ratio of the individual halos mod- 
eled. We divide the fitted virial mass of each halo, 
M V i r j, by the total luminosity of "green" galaxies within 
the equivalent virial radius of the halo (virial ellipse in 
the case of the main halo modeled with eNFW, Ze), 
Li = J2ker<r vh - L R C k- We list the M/L ratios in 
Table 6 and we plot the resulting M v [ r /L vs. M v ; r in 
Figure 11. All the halos, apart from halos zg and zg dis- 
cussed above, show values that are expected for group- 
to cluster-sized halos, ranging from sw 200 — 300 (Rincs 
et al. 2000, 2004; Katgert et al. 2004). However, the large 
uncertainties in the M/L ratios measured, especially for 
the low-mass halos, limit us from determining if there is 
any trend in this relation. 

For consistency check, we also calculate the M/L ra- 
tios for the single-halo NFW models we fit to our recon- 
structed mass profiles in subsections 7.1 and 7.2 (listed 
in Table 5), and compare with the M/L ratio we get for 
the main halo from our multi-halo modeling, M/Lq ~ 
230. Within the errors, we get comparable M/L val- 
ues for the one-dimensional NFW fit (subsections 7.1), 
M/L ~ 280, which are reasonable for a cluster as mas- 
sive as MACSJ0717, whereas the values for the NFW fit 
to the 2D mass reconstruction method given in subsec- 
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Figure 11. Mass-to-light M/L vs. M v - lr for the 9 halos modeled 
in Section 8.2. The mass halos that lie at the cluster redshift 
are shown as red stars, and the mass halos at the foreground of 
the cluster are plotted with blue circles. The first two points at 
~ lx 10 14 Mq/Yi correspond to the lower-significance halos that 
have no optical counterpart (see text). 

tion 7.2, M/L ~ 300, are higher in comparison, reflecting 
the higher total mass found, due to the inclusion of the 
second massive halo at larger radii in the virial mass fit- 
ted. 

8.3. SaWLens SL+WL reconstruction 

We preform a complementary joint SL+WL recon- 
struction using the method of (Mcrtcn et al. 2009, 2011, 
hereafter SaWLens). The SaWLens method combines 
central SL constraints from multiple-image systems with 
WL distortion constraints in a non-parametric manner to 
reconstruct the underlying lensing potential on a multi- 
scale grid. Here, the density of constraints sets our res- 
olution scale. Inside the HST region, we combine con- 
straints from HST WL shape distortions with those of SL 
multiple-image sets. This yields a very high-resolution 
scale of 9" box size. Beyond this, we rely on Subaru WL 
shape distortion constraints, yielding a resolution scale of 
25". There is good consistency between the mass struc- 
tures probed by the mass map derived here and in Sec- 
tion 8.1. We further derive azimuthally- averaged mass 
profile, in order to compare with our other mass pro- 
files derived with the methods presented in the previous 
sections (see Figure 13, blue triangles). SaWLens er- 
ror bars were derived from 1000 bootstrap realizations of 
the WL input catalog and from 2000 re-samplings of the 
SL input data. Since many of the multiple image sys- 
tems have only (less certain) photometric redhsifts, SL 
realizations were obtained by randomly assigning red- 
shifts within the equivalent uncertainties of each system. 
We further present the inner substructure seen by this 
method in the next subsection. 

8.4. Mass and Light Distribution inside the Cluster 

Core 

In this subsection, we present the 2D analysis of the 
WL shear map as measured from background galaxies 
in the HST field, defined in section 6. We also present 
the inner region of the SaWLens reconstruction as an 
independent analysis of the cluster core. 
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To avoid systematic effects due to the finite-field of 
the HST FOV, we combine our HST background sample 
with the Subaru background sample (Section 6), where 
we remove Subaru objects that match with HST objects 
to avoid duplication. We also correct for the different 
depths of the HST relative to the Subaru background 
samples, as explained in Section 6. In order to avoid 
infinite noise issues we first smooth the 2D shear field 
with a Gaussian of 0.25' FWHM. This essentially limits 
our sensitivity to mass structure at small angular scales, 
depending on the local source number density (S/N ex 

l/2\ 

We reconstruct the high-resolution mass map in the 
central 5' x 5' region from the 2D shear map, using an 
entropy-regularized maximum-likelihood reconstruction 
method (hereafter, maximum entropy method, or MEM) 
of Umctsu & Broadhurst (2008). We present the result- 
ing mass distribution in Figure 12 (upper left) in terms 
of the reconstruction S/N map, with 3, 5, lo K mass con- 
tours overlaid. The errors for the mass map are based 
on the theoretical covariance matrix (Umctsu & Broad- 
hurst 2008). As an independent comparison, we also 
present the inner region of the mass map derived from 
the SaWLens method, described in the previous section. 
This is shown in Figure 12 (right panel). A grid-size of 
~ 9" is possible in this region due to several SL multiple- 
image systems, augmented by the HST measured WL 
galaxy shapes. We overlay the light distribution in both 
maps (white contours) as determined from the cluster 
members selected in section 6. 

Broadly, both maps show a mass distribution that fol- 
lows the light as determined by the member galaxies. 
Substantial substructure is probed by both methods, and 
several distinct mass peaks are evident throughout. The 
complexity of the mass distribution demonstrates that 
a clear center is not well-defined in this cluster, which 
instead of a dominant BCG has several bright galaxies 
associated with the different mass clumps. This level of 
substructure was previously reported by Ma et al. (2009) 
from Chandra X-ray data, showing several brightness 
peaks (see their Figure 1) and an extended shock feature. 
We overlay the X-ray map (also rederived here in Sec- 
tion 9.2 below) on the HST color image (Figure 12, bot- 
tom right). Combining with dynamical measurements, 
Ma et al. (2009) concluded the existence of 4 main mass 
halos (denoted A, B, C, and D in their figure) suggest- 
ing a triple-merger. One of those clumps (B) was shown 
to still be in its infancy at very high collision velocity 
of s ~ 3000 km s" 1 , infalling through the cluster. Using 
their SL model, L12 also find a similar substructure in 
the dark matter distribution, and attempt to model the 
mass map with 1-5 mass halos, finding a best solution 
given by 4 mass halos. We note L12's peaks on both 
maps (blue crosses, denoted as A-D following Ma et al.'s 
designation) . 

We also overlay the MEM mass contours on the SL 
model derived in Section 5, shown along with L12 mass 
peak locations in the bottom-right panel of Figure 12. 
We note that the WL-MEM results agree very well with 
our parametric SL results (Section 5), both demonstrat- 
ing an overall agreement between the distributions of the 
galaxies and DM, although our parametric SL method 
does use the light distribution as an initial approxima- 



tion of the mass. For the WL-MEM results, this is not 
trivial at all, because the WL-MEM method is entirely 
non-parametric and is not aware of the distribution of 
cluster members, hence providing a model-independent 
reconstruction of the underlying total mass distribution. 

Although we find an overall agreement with previous 
and current SL analyses where the DM follows the lo- 
cation of the galaxies, some differences are noticeable. 
From the MEM-derived mass map, the peak location of 
the main clump "C" is somewhat offset from those of 
cluster galaxies and X-ray emission (that rather agree 
with each other) by about ~ 0.33' « 90kpc//i, although 
their extended features overlap well with each other. 

The SaWLens derived map, on the other hand, shows 
much larger differences between the mass and light distri- 
butions. The mass peaks in the SaWLens reconstruction 
are mostly offset from those of cluster galaxies. Most 
notably, the location of the SaWLens mass peak near- 
est to clump "A" is further displaced to the center than 
the light or the location found by L12, and clump "B" is 
much less pronounced in this mass map. 

The current analysis is limited by the spatial resolution 
set by the number density of background galaxies, and 
may be prone to potential systematics, especially in the 
critical lensing regime. In this work we therefore cannot 
draw any significant conclusion about the level of offsets 
between the DM, galaxies, and hot gas. 

9. DISCUSSION 
9.1. Mass Profiles from Different Lensing Methods 

The mass profiles we constructed in the previous sec- 
tions using various methods are summarized in Figure 13: 
a parametric light-traces-mass SL mass profile (green; 
Section 5), a non-parametric Bayesian mass profile in- 
corporating WL ID shear+magnification constraints and 
the SL mass constraint (red squares; Section 7.1), a non- 
parametric approach combining WL 2D shear+ ID mag- 
nification constraints (black circles; Section 7.2), and fi- 
nally, a mass profile obtained with SaWLens (blue tri- 
angles; Section 8.3), an independent non-parametric re- 
construction combining SL+WL shear constraints on a 
joint multi-scale grid. Here we summarize and compare 
the different results. 

As can be seen, overall good agreement is found in 
the regions of overlap between the different reconstruc- 
tion methods, within the errorbars. The relatively good 
agreement we find on large scales is reassuring, given 
that it is all based on the same Subaru imaging data. 
At r > 1 Mpc/h there is general change in the sign of 
the gradient of the mass profile, where a clear flatten- 
ing tendency is seen. A similar behavior was also found 
in our recent CLASH lensing analysis of the high-mass 
relaxed cluster MACSJ 1206-08 at z = 0.44 (M vir ~ 
1.1 x 10 15 M o /h, Umetsu et al. 2012), albeit to a lower 
extent, and may in both cases reflect the relative promi- 
nence of the surrounding filamentary pattern of struc- 
tures extending beyond the virial radius, and hence the 
relatively less evolved stage of these very massive clusters 
at z ~ 0.5, compared to their lower redshift counterparts, 
such as A1689 and A1703, where the outer mass profile 
continues to steepen (Broadhurst et al. 2005a; Umctsu 
et al. 2011b) and for which no prominent filamentary 
network is visible. 
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Figure 12. Top left: S/N map of the WL MEM reconstructed mass-density distribution. The black contours start at 3<r K with 2cr K 
intervals. We also plot the light contours (white) and the SL mass peak (A— D) found by L12. Top right: The SaWLens SL+WL 
reconstructed mass-density distribution, light contours (white) and L12 SL mass peak. Bottom left: The parametric SL model derived in 
Section 5, using the light from cluster galaxies as a proxy of the mass. The WL MEM mass reconstruction (black contours) and L12 SL 
mass peaks are also overlaid. Bottom right: HST color image of MACSJ0717. Cyan contours show the DM distribution as determined from 
the WL shear analysis. Yellow contours show the light distribution of cluster members. Red contours show the Chandra X-ray Luminosity 
map. The cluster DM component is comprised of several mass clumps, with several offsets between the DM, gas and galaxies. Notably, 
the central DM core lies between the light peaks. 



MACSJ0717 has been recently analyzed by Jauzac 
ct al. (2012) using multiple HST/ ACS pointings aimed 
at covering the filamentary structure around the cluster. 
Their mass reconstruction shows slightly higher densi- 
ties at all radii and most notably at the outer radii, al- 
beit still in agreement with our 2D shear+magnification 
density profile (black circles), within the errors. This 
small systematic difference may arise due to Jauzac et 
al.'s strategy of surveying only the area where the fila- 
mentary structure is dominant, so that the mean lensing 
signal that we obtain with our larger FOV is on average 
expected to be lower, as observed. 

The total masses of the cluster are presented in Sec- 
tion 9.4 where we compare with prediction for the highest 
mass halos expected in the context of the ACDM model. 



9.2. X-Ray results from Chandra and XMM-Newton 

Here we compare complementary X-ray derived total 
masses to our ID lensing-derived total masses. One 
mass estimator uses the assumption that the gas frac- 
tion at r25oo is constant (for this work we assume / gas = 
Mg as /Mtotai = 0.11). The other common mass estimator 
uses the assumption that the primary pressure support 
for the gas is thermal pressure, that is, that the gas is 
in nearly hydrostatic equilibrium (HSE). While it is true 
that this system is merging and highly complex (as shown 
in Ma et al. 2009), a comparison of the HSE mass with 
the lensing mass can be used to place estimates on the 
magnitude of non-thermal pressure support in the gas, 
such as bulk motions and turbulence. 
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Figure 13. Surface mass density profile, combining and com- 
paring results from our parametric SL analysis of ChASH-HST 
data (green), ID WL (shear+magnification) + SL analysis (red 
squares), 2D WL (shear+magnification) analysis (black circles), 
and SaWLens 2D shear+SL analysis (blue triangles). All the WL- 
shear analyses are based on both HST and Subaru datasets, where 
the WL constraints at ^ 3' are based on the Subaru observations. 
Good agreement between the SL and WL analysis is seen in the re- 
gion of overlap. There is also good agreement between the different 
methods of WL analysis. 

We analyze public Chandra observations from a 
single OBS-ID (4200) with a net exposure time of 
59,000 seconds. Gas density profiles from the ACCEPT 
database (Cavagnolo et al. 2008) were used to generate 
an enclosed gas mass profile. We also derived radial pro- 
files based on spectra extracted from concentric annular 
apertures. The tool used for the HSE mass estimate is 
JACO (Joint Analysis of Cluster Observations, Mahdavi 
et al. 2007); we refer the reader to this paper for the de- 
tails of the X-ray analysis procedure. The best-fit NFW 
parameters derived from the joint JACO fit, at r25oo are, 
M 250 o - 4.45JI°;^ x 10 14 M Q /h, c 2500 = 0.6 ± 0.3. The 
X-ray profile has a limited range r < 0.7Mpc//i. 

We compare the X-ray total mass within r < 500 kpc/h 
with lensing-derived values at the same clustercentric ra- 
dius in Table 5. The X-ray mass seems to be about 
- 87% of the lensing ID WL+SL NFW fitted value. In 
Figure 15 we compare the resulting total mass (Msd < r) 
profiles from the X-ray fit (magenta) and the lensing ID 
WL+RE NFW fit (orange). We find that the X-ray 
mass profile derived with the HSE assumption under- 
estimates the lensing profile at all radii. The difference 
is most extreme in the core of the cluster, where the 
HSE X-ray mass is only 45% of the WL mass, rising to 
agreement with the WL mass at the limit of the X-ray 
data, r = 0.7Mpc//i. This inconsistency clearly stems 
from the departure from HSE at the center of the cluster 
where significant merging activity has been claimed, aug- 
mented by the highly non-spherical shape of the cluster 
(Meneghetti et al. 2010; Rasia et al. 2012). We enhance 
our discussion below by further comparing to the SZE- 
derived mass (see Section 9.3) which also relies on the 
HSE assumption. 

For a wider look at the 2D distribution of the 
gas, we also analyze XMM-Newton observations of 
MACSJ0717 recently made public (P.I. Million, ob- 



servations 672420101, 672420201, 672420301, totaling 
145,000 seconds). We perform a standard reduction on 
the observations, removing high flare times. To achieve 
the best sensitivity to gas at large clustercentric radius, 
we summed the MOS1, MOS2, and p-n CCDs of all three 
observation sets. The brightest point sources were iden- 
tified and excluded. Since the PSF for XMM gets fairly 
distorted and elongated at off-axis angles, there is incom- 
plete removal of the halos around point sources there. 

We present the logarithmic X-ray brightness map in 
Figure 14, where we smooth the map with a Gaussian of 
$fwhm = 0.5' and overlay it with logarithmic contours 
starting at 1.5ax above the background with 2ax inter- 
vals (solid black lines). For comparison, we overlay the 
mass density (2.5- and 6.5-ct k ) contours from the Subaru 
lensing map (solid white line) and the galaxy distribution 
2.5cr„ contour (dotted black line). 

The cluster X-ray brightness peak is clearly shifted 
from the mass (and galaxy) peak, at about R — 0.37' = 
100 kpc/h north-west of the optical center. This has been 
noted in previous detailed analyses of the cluster center 
(Ma et al. 2009) and was interpreted to be due to trail- 
ing of the gas component behind the cluster as subclumps 
cross through the main cluster halo, because the gas and 
the DM have different cross-sections. We summarize in 
Table 1 the location of the optical center we have been 
using and that of the X-ray brightness peak. 

With the advent of the XMM-Newton data, we can 
also see here the clear detection of the secondary massive 
component (denoted z 5 in our mass map, see Figure 10) 
at a 5.5(7x level, and the third mass component (denoted 
Z2 in our mass map) at a 2.5cxx level. Finally, the forth 
mass component detected from lensing, Z3, is evident in 
the map but at much lower significance, ~ 1.6a x- Al- 
though lensing shows this halo to be more massive than 
z 2 , it is hard to detect it in the X-ray image since it 
lies at the edge of our data where it is more prone to 
contamination by point sources. 

Overall, it seems the gas in the outskirts (at a scale of 
,$ 0.7Mpc//i) is more spherically-distributed around the 
main cluster component, but has the same substructure 
corresponding to high-mass peaks along the filamentary 
structure. The emission from the hot gas is expected to 
be more spherical in shape than that mapped by lens- 
ing because the gas in projection traces the gravitational 
potential, which is rounder than the density distribution. 

9.3. SZE Results using Bolocam 

We obtained SZE observations of MACSJ0717 us- 
ing Bolocam at the Caltech Submillimeter Observatory. 
Bolocam is a 144-element bolometric array operating at 
140 GHz with an angular resolution of 58" (full- width at 
half-maximum). The data were collected by scanning 
Bolocam's 8' FOV in a Lissajous pattern to produce 
an image with coverage extending to ~ 10' in radius. 
Full details of the observations are given in Mroczkowski 
et al. (2012) and Sayers et al. (2012), and the data were 
reduced using the procedures described in Sayers et al. 
(2011). In addition, we note that in the multiwavelength 
SZE analysis of Mroczkowski et al. (2012), which used 
the same Bolocam SZE data as this analysis, a tenta- 
tive kinetic SZE signal from subclump "B" of Ma et al. 
(2009) is reported at a significance of ~ 2a. Conse- 
quently, we subtracted the best-fit kinetic SZE template 



18 



CLASH: MACSJ0717 




xlO 15 



5 

X [arcmin] 

Figure 14. X-ray map from XMM-Newton data in logarithmic 
scale, smoothed to $fwhm = 0.5', with 1.5cr contours with 2<r 
intervals (solid black lines). Also overlaid are k contours (solid 
white lines) and galaxy density contour (dotted black line). 

of Mroczkowski et al. (2012) from our data, although we 
note that Mroczkowski et al. (2012) were not able to ro- 
bustly constrain a spatial template of this kinetic SZE 
signal, modeling it as a simple Gaussian centered on the 
location of "B" . We then fit a spherical generalized NFW 
pressure (gNFW) model (Nagai et al. 2007) to our SZE 
data allowing both the normalization and scale radius to 
vary while using the best-fit slope parameters found by 
Arnaud et al. (2010). 

We compute the mass of MACSJ0717 from this gNFW 
fit to the SZE data using the formalism described by 
Mroczkowski (2011). In particular, we assume that 
the cluster is spherical and in HSE, although we note 
that neither of these assumptions are strictly valid for 
MACSJ0717 due to its complex dynamical state. The 
resulting SZE mass profile is shown in blue in Figure 15, 
and we note that it is approximately ~ 75% of our 
lensing-derived mass profile. However, the SZE mass pro- 
file is in reasonably good agreement with the X-ray de- 
rived mass profile, which also assumed HSE. We therefore 
conclude that the difference is due to the bias associated 
with assuming HSE. For example, even for relaxed sys- 
tems we expect the HSE-derived virial mass to be biased 
approximately 10 — 20% low on-average compared to the 
true virial mass due mainly to bulk motions in the gas 
(Nagai et al. 2007; Rasia et al. 2004, 2012; Meneghetti 
et al. 2010). Furthermore, in order to gauge the impact of 
the kinetic SZE signal from subclump "B" , we also com- 
puted an SZE mass profile without subtracting the ki- 
netic SZE template (shown in gray in Figure 15). As ex- 
pected, the difference between the two SZE mass profiles 
is most significant in the inner regions of MACSJ0717 
near the center of subclump "B", where the corrected 
mass is ~ 50% lower than the uncorrected one, decreas- 
ing to only < 15% difference at large radius. 

9.4. Comparison with KCDM Predictions 

After reliably estimating the mass of this cluster and its 
surroundings in Sections 7 and 8 we are in the position to 
address the probability of finding such a massive cluster 
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Figure 15. Integrated total mass profiles M3d(< r), derived from 
an NFW fit to WL+SL mass reconstruction (orange), from the 
SZE data after subtracting the best-fit kinetic SZE template found 
by Mroczkowski et al. (2012) (blue) and from X-ray (magenta). 
The SZE and X-ray fits are largely consistent with each other, but 
systematically lower than the lensing derived mass profile at all 
radii, presumably due to biases associated with the HSE assump- 
tion used in deriving both the SZE and X-ray mass profiles. For 
completeness, we also show the mass profile derived from the SZE 
data without correcting for the kinetic SZE signal (gray). 
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Figure 16. PDF of the maximum cluster mass expected to be 
found in a survey with sky coverage of the MACS survey, As = 
22, 735deg 2 , within the redshift interval 0.5 < z < 1.0. The red 
vertical lines represent the 1, 2, 3<r upper confidence levels. The 
blue circle denotes the measured lensing mass of MACSJ0717. 

in our Universe via extreme value statistics (Mortonson 
et al. 2011; Colombi et al. 2011; Waizmann et al. 2012b; 
Hotchkiss 2011; Harrison & Coles 2012). In the work 
of Waizmann et al. (2012b) several high-mass clusters 
were examined (Abell2163, Abell370, RXJ 1347-1145 
and 1E0657— 558) and the cumulative probability func- 
tion of finding such massive systems in their given survey 
area was calculated using general extreme value statis- 
tics. However, they found that none of those clusters 
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alone is in tension with ACDM. In another paper, Waiz- 
mann et al. (2012c) test the probability of the extremely 
large Einstein radius measured for MACSJ0717, previ- 
ously estimated as Be — 55" (for z — 2.5, Zitrin et al. 
2009a) within the standard ACDM cosmology. Although 
they find this system not to be in tension with the 
ACDM expectations, the sensitivity of the Einstein ra- 
dius distribution to various effects (lens triaxiality, mass- 
concentration, inner slope of the halo density profile and 
more) implies that this comparison is much less reliable 
than that using a total lensing mass. To conclude, Waiz- 
mann et al. (2012c) calculate that a galaxy cluster in the 
redshift range 0.5 < z < 1.0 would need to have a total 
mass of at least M 2 oom — 4.5 x 10 15 M Q , where M 2 oom is 
defined with respect to the mean density of the universe 
at the time of collapse, in order to exclude ACDM at the 
3cr level. 

Here we use as the total mass for MACSJ0717 the 
sum of the virial masses of the two components that lie 
within the virial region of the main halo, M v - a (z5 + Ze) ~ 
(1.93 ± 0.31) x 10 15 M Q /h, as determined by multi-halo 
modeling (see Tables 6) which is also highly consistent 
with the values found from single-halo modeling (see 
Tables 5). To compare with Waizmann et al.'s values, 
MACSJ0717 has M 200m ~ (2.9 ± 0.5) x 10 15 M from 
lensing. In Figure 16 we show the probability distri- 
bution function (PDF) of the most massive cluster (for 
details of this calculation see Davis et al. 2011; Waiz- 
mann et al. 2011, 2012b) as calculated for the MACS 
survey area, As = 22,735deg 2 , within the a-priori red- 
shift interval 0.5 < z < 1.0. We overlay the total mass 
estimate of MACSJ0717 as the blue error bar. It can 
be seen that the mass of MACSJ0717 falls slightly be- 
low the 95 per cent quantile. In order to avoid the bias 
in the occurrence probabilities of rare galaxy clusters as 
discussed in Hotchkiss (2011), one has to set the redshift 
interval a-priori. Since MACSJ0717 happens to fall at 
the lower end of the chosen redshift interval, the PDF in 
Figure 16 can be considered as a conservative estimate 
of the true rareness of the cluster such that a more thor- 
ough treatment, as presented in Harrison & Hotchkiss 
(2012), would yield a lower rareness. 

We therefore conclude that MACSJ0717's mass, being 
the largest above z = 0.5, does not pose any tension on 
the standard ACDM cosmology. However, the growing 
number of very massive clusters at high redshifts makes 
it advisable to study the occurrence probabilities of en- 
sembles of massive clusters instead of single ones (see 
e.g., Waizmann et al. 2012a). 

10. SUMMARY AND CONCLUSIONS 

In this paper, we have presented a comprehensive lens- 
ing analysis of the merging cluster MACSJ0717 at z = 
0.5458, the largest known cosmic lens with complex in- 
ternal structures, by combining independent constraints 
from WL distortion, magnification, and SL effects. This 
is based on wide-field Subaru+CFHT u* BV R c i' z' J K s 
imaging, combined with detailed deep CLASH- HST 16- 
band imaging in the cluster core. 

We have obtained an improved SL re-analysis of the 
inner mass distribution from the CLASH HST data and 
confirmed our earlier model (Zitrin et al. 2009a), where 
we now measure the effective Einstein radius of 60 ± 6" 



(z = 2.963) in agreement with previous values. 

The deep Subaru multi-band photometry is used to 
separate background, foreground, and cluster galaxy 
populations using the multi-color selection techniques es- 
tablished in our earlier work (Medezinski et al. 2010; 
Umetsu et al. 2010), allowing us to obtain a reliable 
WL signal free from significant contamination of un- 
lensed cluster and foreground galaxies for this relatively 
high redshift cluster. By combining complementary SL, 
WL distortion and magnification measurements, we have 
constructed a model-free mass distribution out to well 
beyond the virial radius (r vir « 2Mpc/i _1 ), effectively 
breaking the mass-sheet degeneracy. 

By fitting projected NFW profiles to the reconstructed 
mass profiles, we obtain consistent virial mass estimates: 
M vir = (1.94 ± 0.41) x 10 15 M Q /ft, from the joint likeli- 
hood analysis of tangential shear, magnification, and SL 
measurements and M vir = (2.15±0.49) x 10 15 M Q /h from 
the joint likelihood analysis of 2D shear and azimuthally- 
averaged magnification measurements. 

We further preformed a 2D lensing analysis to compare 
the reconstructed total mass distribution with the lumi- 
nosity and galaxy distributions over a wide Subaru FOV. 
The Subaru data reveal the presence of several mass com- 
ponents that lie along a filamentary structure at the clus- 
ter redshift. We modeled the 2D shear field with a com- 
posite of nine NFW halos in projection, corresponding to 
mass peaks detected above 2.5a, including the main clus- 
ter approximated by an elliptical NFW halo. Those mass 
halos, which are part of the z ~ 0.55 structure, seem 
to form a filamentary-like structure that spans 4 Mpc/h. 
The sum of the two halos that lie within the virial ra- 
dius, Af vir (z 5 + z 6 ) « (1.93 ± 0.31) x 10 15 M Q /h, is in 
good agreement with the total mass estimates from the 
single-NFW fits to the two joint likelihood analyses we 
have preformed. We conclude that the four mass halos, 
which are confined to the volume within ss 2r vrr , i.e. the 
turn-over radius, will likely accrete onto the main cluster 
to form an even more massive system by z = 0, as much 
as M vir « 2.5 x 10 15 M Q /h (~ 3.6 x 10 15 M ). 

Utilizing the high-resolution HST-CLASH observa- 
tions, we were able to resolve the inner core region of the 
cluster using two independent non-parametric methods, 
one relying on the WL and one combining both WL+SL 
constraints. Both show a good overall agreement be- 
tween the light and the total mass distribution. How- 
ever, the current spatial resolution is not enough to pre- 
cisely measure the offsets (if any) between DM, galaxy, 
and X-ray centroids. The upcoming "Frontier Fields" 
initiative with HST will target 6 massive lensing clus- 
ters, MACSJ0717 amongst them, and will yield unpar- 
alleled deep and detailed mass maps of this complicated 
merging cluster, allowing us for the first time to set real 
constraints on the collisional nature of DM. 

Using extreme value statistics, we found our mean 
mass estimate of M vlr «2x 10 15 M Q /h does not lie out- 
side ACDM predictions, for which a higher mass of at 
least M v ; r ~ 3.5 x 10 15 M Q /h is required for ACDM to 
be ruled out at the 3<7 level, but instead differs only at 
the 2<T level, so that larger samples of clusters are moti- 
vated by our findings, to provide a more definitive joint 
probability. 

In conclusion, we find MACSJ0717 to be the most 
massive cluster so far detected at z > 0.5, and to 
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exhibit significant substructure not only in the cen- 
ter, but also to be part of a LSS that spans over ~ 
4Mpc//i, accreting nearby clusters themselves as mas- 
sive as fewx 10 14 Mq/H. This prominence of surrounding 
structure that we find in our work, including subclusters 
and filaments together with the ongoing merging activity 
found in the core of MACSJ0717 indicate that this ob- 
ject is relatively unevolved compared to massive clusters 
at lower redshift where relaxed clusters such as A1689 
show little surrounding structures. 
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Table 1 

Properties of the galaxy cluster MACSJ0717 

Parameter Value 



ID MACS J0717.5+3745 

Optical position (J2000.0) 

R.A 07:17:32.63 

Decl +37:44:59.7 

X-ray peak position (J2000.0) 

R.A 07:17:31.65 

Decl +37:45:18.5 

Redshift 0.5458 

X-ray temperature (keV) 12.5 + 0.7 

Einstein radius (") 60 ± 3 at z a = 2.963 

References. - - [1] Ebeling et al. (2001); [2] Ebeling 
et al. (2007); [3] Postman et al. (2012); 

Note. — The cluster MACS J0717.5+3745(z = 0.5458) 
was discovered in the Massive Cluster Survey (MACS) as 
described by Reference [1], and its redshift determined by 
[2]. The optical cluster center is defined as the center of 
the bright red-sequence selected galaxies. The X-ray center 
and mean temperature were taken from Reference [3]. We 
note, the mean temperature can vary by larger than the 
quoted uncertainty between authors since it depends on 
the exact location of the X-ray center, which is different 
between [1] and [3]. 



Table 2 

Subaru/Suprime-Cam + CFHT/McgaPrime data. 



Filter 


Exposure timc a 


Seeing* 3 


mi im c 




(ks) 


(arcscc) 


(AB mag) 


u* 


19.63 


0.93 


26.1 


B 


2.4 


1.01 


26.5 


V 


2.2 


0.95 


26.5 


Rc d 


2.9 


0.72 


26.2 


Ic 


3.6 


0.71 


26.0 


z' 


1.6 


0.58 


25.0 


J 


0.9 


0.73 


22.6 


K S 


0.5 


0.54 


23.0 



a Total exposure time. 

b Seeing FWHM in the full stack of images. 

c Limiting magnitude for a 3<r detection within a 2'' 

aperture. 

d Band used for WL shape measurements. 





Table 3 

Galaxy color selection. 




Sample 


Magnitude limits a N 
(AB mag) 


(arcmin 2 ) 


<^> c 


Red 
Green 
Blue 


21 < z' < 25 

z' < 24 
22.5 < z' < 26 


10490 
1252 
11998 


9.6 

1.3 
11.5 


1.21 

0.49/0.54 

2.23 



a Magnitude limits for the galaxy sample. 

b Mean surface number density of source background galaxies. 

c Mean photometric redshift of the sample obtained with the BPZ 

code. 
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Table 4 

Galaxy samples for WL shape measurements. 



Sample 


N 


n g a 
(arcmin 


- 2 ) 


- 9 b 


MACSJ0717 


C 

COSMOS 


(D ls /D s ) d 
MACSJ0717 COSMOS 


Red 

Blue 

Blue+red 


4856 
4738 
9594 


5.65 
5.6 

11.2 




0.41 

0.42 
0.41 


1.1 
1.89 
1.26 


1.08 
1.56 
1.27 


0.43 0.42 
0.59 0.55 
0.48 0.48 



Note. — 

a Mean surface number density of galaxies. 

b Mean RMS error for the shear estimate per galaxy, <r 9 __ , 

c Effective source redshift corresponding to the mean depth (J3) of the sample. 

d Distance ratio averaged over the redshift distribution of the sample, (/3). 



K) 1/2 - 



Table 5 

Best-fit NFW parameters to non-parametric mass reconstruction 
and comparison with X-ray and SZE masses 



Method 


iXvir 

(Mpc/fc) 


M v i r 

(1O 15 M //i) 


M(< 500kpc/h) 

(10 15 M Q /h) 


Kc a 


M vir /L Rc 
(7iM o /L ) 


X 2 /DOF b 


ID WL+SL C 


1.88 ±0.14 


1 04+0.43 


0.62 ±0.17 


0.002 ± 0.014 


280 ± 59 


10/9 


2D WL d 


1.95 ±0.15 


9 ic+0.52 


0.54 ±0.15 


0.026 ±0.013 


303 ± 65 


16/7 


X-ray 






54+O- 04 
u - 04 -0.08 








SZE 






0.50 ±0.04 









Note. — 

a The constant dimensionless mass-sheet component. 

b The goodness-of-fit, minimized \ 2 over number of DOF. 

c The NFW model was fitted to the mass profile fully reconstructed from WL+SL (see § 7.1), constrained 

by ID WL (shear+magnification)+SL Einstein-radius. 

d here, The NFW model was fitted to the mass profile reconstructed from WL alone (see § 7.2), using WL 

(2D shear+lD magnification), without SL constraints. 



Table 6 

Best-fit NFW Multi-halo parameters of the 2D shear Analysis 



Halo 


AX a 


AY a 


^i b 


, c 


JXvir 


A/ Vir d 


e c 


Bj 


Mvir/L Rc 




(arcmin) 


(arcmin) 






(Mpc//i) 


(1O 15 M //i) 




(deg) 


{hM Q /L Q ) 


zl 


-12.11 


6.23 


0.29 


1.21 


1.39 


0.52 ±0.14 


_ 


_ 


220 ± 60 


z2 


-9.73 


-3.57 


0.56 


1.27 


0.81 


0.15 ±0.08 


— 


— 


250 ± 136 


z3 


-6.37 


-9.17 


0.54 


1.26 


0.98 


0.27 ±0.12 


— 


— 


176 ± 75 


zi 


-5.81 


6.79 


0.42 


1.24 


1.11 


0.33 ±0.11 


— 


— 


337 ±113 


z5 


-4.55 


-3.01 


0.55 


1.26 


0.90 


0.22 ±0.11 


— 


— 


145 ± 73 


z6 


-0.10 ±0.12 


-0.08 ±0.08 


0.54 


1.26 


1.81 


1.71 ±0.29 


0.78 ±0.09 


46 ±3 


230 ± 39 


z7 


2.73 


-7.77 


0.43 


1.24 


1.00 


0.24 ±0.10 


— 


— 


238 ± 99 


z,S 


5.39 


10.85 


0.34 


1.22 


0.75 


0.09 ±0.05 


— 


— 


406 ± 247 


z9 


7.91 


3.15 


0.56 


1.27 


0.80 


0.15 ±0.08 


- 


- 


466 ± 252 



Note. — For the main cluster component {zq), the halo centroid was allowed to vary, and an eNFW model was 
fitted. The concentration parameter was also fitted here, whereas for all other less massive halos the concentration 
was set by the mass-concentration relation given by Duffy ct al. (2008). 

a The centroid position of each halo is given relative to the cluster center (see Table 1) in units of arcminutcs. 

b The median photometric redshift for each halo estimated from the green sample. 

c The photometric source redshift for each halo. In the multi-halo shear fitting process, we assumed a flat prior over 

the range z s ± 0.1. 

d Marginalized bi-weight center and scale locations are reported 

The projected mass ellipticity of our eNFW model is defined as e = 1 — b/a < 1 with a and b the major and minor 

axes of the isodensity contours. 

f The position angle of the eNFW halo major axis is given in units of degrees, measured north of west. 



